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PROTEINS ASSOCIATED WITH CELL GROWTH, DIFFERENTIATION, AND DEATH 

TECHNICAL FIELD 

The invention relates to novel nucleic acids, proteins associated witlj ceil growth, 
5 differentiation, and death encoded by these nucleic acids, and to the use of these nucleic acids and 
proteins in the diagnosis, treatment, and prevention of cell proliferative disorders including cancer, 
developmental disorders, neurological disorders, autoimmune/inflammatory disorders, reproductive 
disorders, and disorders of the placenta. The invention also relates to the assessment of the effects of 
exogenous compounds on the expression of nucleic acids and proteins associated with cell growth, 
10 differentiation, and death. 

BACKGROUND OF THE INVENTION 

Human growth and development requires the spatial and temporal regulation of cell 
differentiation, cell proliferation, and apoptosis. These processes coordinately control reproduction, 

15 aging, embryogenesis, morphogenesis, organogenesis, and tissue repair and maintenance. At the 
cellular level, growth and development is governed by the cell's decision to enter into or exit from 
the cell division cycle and by the cell's commitment to a terminally differentiated state. These 
decisions are made by the cell in response to extracellular signals and other enviroimiental cues it 
receives. The following discussion focuses on the molecular mechanisms of cell division, 

20 embryogenesis, cell differentiation and proliferation, and apoptosis, as well as disease states such as 
cancer which can result from disruption of these mechanisms. 
Cell Cycle 

Cell division is the fundamental process by which all Uving things grow and reproduce. In 
unicellular organisms such as yeast and bacteria, each cell division doubles the number of organisms. 

25 In multicellular species many rounds of cell division are required to replace cells lost by wear or by 
programmed cell death, and for cell differentiation to produce a new tissue or organ. Progression 
through the cell cycle is governed by the intricate interactions of proteui conq^lexes. This regulation 
depends upon the appropriate expression of proteins which control cell cycle progression in response 
to extracellular signals, such as growth factors and other mitogens, and intracellular cues, such as 

30 DNA damage or nutrient starvation. Molecules which directly or indirectly modulate cell cycle 
progression fall into several categories, including cyclins, cyclin-dependent protein kinases, growth 
factors and their receptors, second messenger and signal transduction proteins, oncogene products, 
and tumor-suppressor proteins. 

Regulated progression of the cell cycle depends on the integration of growth control 

35 pathways with the basic cell cycle machinery. Cell cycle regulators have been identified by selecting 
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for human and yeast cDNAs that block or activate cell cycle arrest signals in the yeast mating 
pheromone pathway when they are overexpressed. Known regulators include human CPR (cell cycle 
progression restoration) genes, such as CPR8 and CPR2, and yeast CDC (cell division control) genes, 
including CDC91, that block the arrest signals. The CPR genes express a variety of proteins 
5 including cyclins, tumor suppressor bmding proteins, chaperones, transcription factors, translation 
factors, and RNA-binding protems (Edwards, M.C. et al.(1997) Genetics 147:1063-1076). 

Several cell cycle transitions, including the entry and exit of a cell from mitosis, are 
dependent upon the activation and inhibition of cyclin-dependent kinases (Cdks). The Cdks are 
composed of a kinase subunit, Cdk, and an activating subunit, cyclin, in a complex that is subject to 

10 many levels of regulation. There appears to be a single Cdk in Saccharomvces cerevisiae and 

Saccharomvces pombe whereas mammals have a variety of specialized Cdks. Cyclins act by binding 
to and activating cyclin-dependent protem kinases which then phosphorylate and activate selected 
proteins involved in the mitotic process. The Cdk-cyclin complex is both positively and negatively 
regulated by phosphorylation, and by targeted degradation involving molecules such as CDC4 and 

15 CDC53. In addition, Cdks are further regulated by binding to inhibitors and other proteins such as 
Sucl that modify thek specificity or accessibility to regulators (Patra, D. and W.G. Dunphy (1996) 
Genes Dev. 10:1503-1515; and Mathias, N. et al. (1996) Mol. Cell Biol. 16:6634-6643). 
Reproduction 

The male and female reproductive systems are complex and involve many aspects of growth 
20 and development The anatomy and physiology of the male and female reproductive systems are 

reviewed in (Guyton, A.C. (1991) Textbook of Medical PhvsiolQgv . W.B. Saunders Co., Philadelphia 
PA, pp. 899-928). 

The male reproductive system includes the process of spermatogenesis, in which the sperm 
are formed, and male reproductive functions are regulated by various hormones and their effects on 
25 accessory sexual organs, cellular metabolism, growth, and other bodily functions. 

Spermatogenesis begins at puberty as a result of stimulation by gonadotropic hormones 
released from the anterior pituitary. Immature sperm (spermatogonia) undergo several mitotic cell 
divisions before undergoiag meiosis and fiill maturation. The testes secrete several male sex 
hormones, the most abundant being testosterone, that is essential for growth and division of the 
30 unmature sperm, and for the masculine characteristics of the male body. Three other male sex 
hormones, gonadotropin-releasing hormone (GnRH), luteinizmg hormone (LH), and follicle- 
stimulating hormone (FSH) control sexual fiinction. 

The uterus, ovaries, fallopian tubes, vagina, and breasts comprise the female reproductive 
system. The ovaries and uterus are the source of ova and the location of fetal development, 
35 respectively. The fallopian tubes and vagina are accessory organs attached to the top and bottom of 
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the uterus, respectively. Both the uterus and ovaries have additional roles in the development and 
loss of reproductive capability during a female's lifetime. The primary role of the breasts is lactation. 
Multiple endocrine signals from the ovaries, uterus, pituitary, hypothalamus, adrenal glands, and 
other tissues coordinate reproduction and lactation. These signals vary during the monthly 
5 menstruation cycle and during the female's lifetime. Similarly, the sensitivity of reproductive organs 
to these endocrine signals varies during the female's lifetkne. 

A combmation of positive and negative feedback to the ovaries, pituitary and hypothalamus 
glands controls physiologic changes during the monthly ovulation and endometrial cycles. The 
anterior pituitary secretes two major gonadotropin hormones, follicle-stimulating honnone (FSH) and 
10 luteinizing hormone (LH), regulated by negative feedback of steroids, most notably by ovarian 
estradiol. If fertilization does not occur, estrogen and progesterone levels decrease. This sudden 
reduction of the ovarian hormones leads to menstruation, the desquamation of the endometrium. 

Hormones further govern all the steps of pregnancy, parturition, lactation, and menopause. 
During pregnancy large quantities of human chorionic gonadotropin (hCG), estrogens, progesterone, 
15 and human chorionic somatomammotropin (hCS) are formed by the placenta. hCG, a glycoprotein 
similar to luteinizing hormone, stimulates the corpus luteum to continue producing more progesterone 
and estrogens, rather than to involute as occurs if the ovum is not fertiUzed. hCS is similar to growth 
hormone and is crucial for fetal nutrition. 

The female breast also matures during pregnancy. Large amounts of estrogen secreted by the 
20 placenta trigger growth and branching of the breast milk ductal system while lactation is initiated by 
the secretion of prolactin by the pituitary gland. 

Parturition involves several hormonal changes that mcrease uterine contractility toward the 
end of pregnancy, as follows. The levels of estrogens increase more than those of progesterone. 
Oxytocin is secreted by the neurohypophysis. Concomitantly, uterine sensitivity to oxytocin 
25 increases. The fetus itself secretes oxytocin, Cortisol (from adrenal glands), and prostaglandins. 

Menopause occurs when most of the ovarian follicles have degenerated. The ovary then 
produces less estradiol, reducing the negative feedback on the pituitary and hypothalamus glands. 
Mean levels of circulating FSH and LH increase, even as ovulatory cycles continue. Therefore, the 
ovary is less responsive to gonadotropins, and there is an increase in the time between menstrual 
30 cycles. Consequently, menstrual bleeding ceases and reproductive capability ends. 
Cell Differentiation and Proliferation 

Tissue growth involves complex and ordered patterns of ceU proliferation, cell 
differentiation, and apoptosis. Cell proliferation must be regulated to maintain both the number of 
cells and their spatial organization. This regulation depends upon the appropriate expression of 
35 proteins which control cell cycle progression in response to extracellular signals, such as growth 
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factors and other mitogens, and intracellular cues, such as DNA damage or nutrient starvation. 
Molecules which directly or indirectly modulate cell cycle progression fall into several categories, 
including growth factors and their receptors, second messenger and signal transduction proteins, 
oncogene products, tumor-suppressor proteins, and mitosis-promoting factors. 
5 Growth factors were originally described as serum factors requited to promote cell 

proliferation. Most growth factors are large, secreted polypeptides that act on cells ia their local 
envu-onment. Growth factors bind to and activate specific cell surface receptors and initiate 
intracellular signal transduction cascades. Many growth factor receptors are classified as receptor 
tyrosine kinases which undergo autophosphorylation upon ligand binding. Autophosphorylation 

10 enables the receptor to interact with signal transduction proteins characterized by the presence of SH2 
or SH3 domains (Src homology regions 2 or 3). These proteuis then modulate the activity state of 
small G-proteins, such as Ras, Rab, and Rho, along with GTPase activating proteins (GAPs), guanine 
nucleotide releasmg protems (GNRPs), and other guaniae nucleotide exchange factors. Small G 
proteins act as molecular switches that activate other downstream events, such as mitogen-activated 

15 protein kinase (MAP kinase) cascades. MAP kinases ultimately activate transcription of mitosis- 
promoting genes. 

In addition to growth factors, small signaling peptides and hormones also influence cell 
proliferation. These molecules bind primarily to another class of receptor, the trimeric G-protein 
coupled receptor (GPCR), found predominantly on the surface of immune, neuronal and 

20 neuroendocrine cells. Upon ligand binding, the GPCR activates a trimeric G protein which in tum 
triggers increased levels of mtracellular second messengers such as phospholipase C, Ca2+, and 
cyclic AMP. Most GPCR-mediated signaling pathways mdirectly promote cell proliferation by 
causing the secretion or breakdown of other signalmg molecules that have direct mitogenic effects. 
These signalmg cascades often involve activation of kmases and phosphatases. Some growth factors, 

25 such as some members of the transforming growth factor beta (TGF-P) family, act on some cells to 
stimulate cell proliferation and on other cells to inhibit it. Growth factors may also stimulate a cell at 
one concentration and inhibit the same cell at another concentration. Most growth factors also have a 
multitude of other actions besides the regulation of cell grovrth and division: they can control the 
proliferation, survival, differentiation, migration, or function of cells dependmg on the circumstance. 

30 For example, the tumor necrosis factor/nerve growth factor (TNF/NGF) family can activate or inhibit 
cell death, as well as regulate proliferation and differentiation. The cell response depends on the type 
of cell, its stage of differentiation and transformation status, which surface receptors are stimulated, 
and the types of stimuli acting on the cell (Smith, A. et al. (1994) Cell 76:959-962; and Nocentini, G. 
et al. (1997) Proc. Natl. Acad. Sci. USA 94:6216-6221). 

35 Neighboring cells in a tissue compete for growth factors, and when provided with "unlimited" 
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quantities in a perfused system will grow to even higher cell densities before reaching density- 
dependent inhibition of cell division. Cells often demonstrate an anchorage dependence of cell 
division as well. This anchorage dependence may be associated with the formation of focal contacts 
linking the cytoskeleton with the extracellular matrix (ECM). The expression of ECM components 
5 can be stimulated by growth factors. For example, TGF-P stimulates fibroblasts to produce a variety 
of ECM protems, including fibronectin, collagen, and tenascin (Pearson, C.A. et al. (1988) EMBO J. 
7:2677-2981). In fact, for some cell types specific ECM molecules, such as laminin or fibronectin, 
may act as growth factors. Tenascin-C and -R, expressed in developing and lesioned neural tissue, 
provide stimulatory/anti-adhesive or inhibitory properties, respectively, for axonal growth (Faissner, 

10 A. (1997) CeU Tissue Res. 290:331-341). 

Cancers are associated with the activation of oncogenes which are derived from normal 
cellular genes. These oncogenes encode oncoproteins which convert normal cells into malignant 
cells. Some oncoproteins are mutant isoforms of the normal protein, and other oncoproteins are 
abnormally expressed with respect to location or amount of expression. The latter category of 

15 oncoprotein causes cancer by altering transcriptional control of cell proliferation. Five classes of 
oncoproteins are known to affect cell cycle controls. These classes include growth factors, growth 
factor receptors, intracellular signal transducers, nuclear transcription factors, and cell-cycle control 
proteins. Viral oncogenes are integrated into the human genome after infection of human cells by 
certain viruses. Examples of viral oncogenes include v-src, v-abl, and v-fps. Many cases related to 

20 the overexpression of proteins associated with tumors and metastasis have been reported. The Mtal 
gene has been cloned in mice, in both cell lines and tissues representing metastatic tumors (Simpson, 
A. et al. (2001) Gene 273:29-39), Expression of the melanoma antigen-encoding gene (MAGE) 
family of proteins has also been detected in many tumors. GACI, a new member of the leucine-rich 
repeat superfamily, is ampUfied and overexpressed in malignant gliomas (Almeida, A. et al. (1998) 

25 Oncogene 16:2997-3002). 

Many oncogenes have been identified and characterized. These include sis, erbA, erbB, her- 
2, mutated G,, src, abl, ras, crk, jun, fos, myc, and mutated tumor-suppressor genes such as RB, p53, 
mdm2, Cipl, pl6, and cyclin D. Transformation of normal genes to oncogenes may also occur by 
chromosomal translocation. The Philadelphia chromosome, characteristic of chronic myeloid 

30 leukemia and a subset of acute lymphoblastic leukemias, results from a reciprocal translocation 
between chromosomes 9 and 22 that moves a tnincated portion of the proto-oncogene c-abl to the 
breakpoint cluster region (bcr) on chromosome 22. 

Tumor-suppressor genes are involved in regulating cell proliferation. Mutations which cause 
reduced or loss of function in tumor-suppressor genes result in uncontrolled cell proliferation. For 

35 example, the retinoblastoma gene product (RB), in a non-phosphoiylated state, binds several early- 
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response genes and suppresses their transcription, thus blocking cell division. Phosphorylation of RB 
causes it to dissociate from the genes, releasing the suppression, and allowing cell division to 

proceed. 

SEB (SET-bindmg protein) is a novel nuclear protein that interacts in a yeast two-hybrid 
5 system and in human cells with SET, the translocation breakpoint-encoded protein in acute 

undifferentiated leukemia. SEB also has an oncoprotein Ski homologous region, six PEST suquences 
and three sequential PPLPPPPP repeats at the C-terminus. SEB mRNA is expressed ubiquitously in 
all examined human adult tissues and ceUs. SET has been mapped to chromosome 18q21.1. This 
reagon also contains tumor suppressor genes associated with deletions in cancer and leukemia 
10 (Minakuchi, M. et al. (2001) Eru. J. Biochem. 268:1340-1351). 

Angiogenesis is die process by which new capiUaries are formed by sprouting from 
preexisting vessels. It is a vital function for the grovrth of normal tissues during embryogenesis as 
well as for the pathological growth of tumors. Pathological proliferation fo cancer ceUs wUl not 
result in a proportional increse in mass without access to the blood circulation. Tumors form their 
15 own circulatory system by upregulating angiogenic stimulators and by downregulation angiogenesis 
inhibitors. The inability of metastases to induce an angiogenic response results in a dormant 
phenotype. Angiostatm is a circulating inhibitor of angiogenesis. In vitro, it inhibits endotheUal ceU 
migration, proliferation, and tube formation, and induces apoptosis in a cell type-specific manner. 
Angiomotin is an angiostatin-binding peptide that mediates angiostatin inhibition of migration and 
20 tube formation of endothelial cells. Angiomotin is expressed in the endothelial cells of capillaries 
and the larger vessels of the human placenta. Angiostatin inhibits ceU migration by interfering with 
angiomotin activity in endothelial cells (Troyanovsky, B. et al. (2001) J. Cell Biol. 152:1247-1254). 

Nucleolar protein pl20 is a proliferation-associated antigen expressed by cells in early Gl 
phase, identified by the monoclonal antibody FB. It is very cancer specific. In fact, quantitative 
25 immunohistochemical analysis of pl20 protein is an easy and reliable method for the assessment of 
clinical outcome and the definition of risk groups in oral carcinoma (Ventura, L. et al. (1999) 
Anticancer Res. 19:1423-1426). P120 contains a basic domam, an acidic domain, a hydrophobic and 
methionine-rich domain, and a domam rich m cysteme and prolme residues (Fonagy, A. et al. (1989) 
Cancer Conmiun. 1:243-251). This protein is expressed in early Gl and has not been detected in 
30 benign tumors and most normal resting tissues. Sato et al. show that the expression level of pl20 in 
tumor tissues can be used as an independent and powerful prognostic marker for resected lung 
adenocarcinoma (Sato, G. et al. (1999) J. Clin. Oncol. 17:2721-2727). 

The human LGIl gene is a leucine-rich, repeat-containmg gene that was cloned from the 
t(10:19) breakpoint of the T98G glioblastoma ceU line. The LGIl gene maps to 10q24, a region of 
peak LOH in malignant gliomas, and is inactivated during the transition from low to high-grade brain 



35 



6 



wo 03/050253 



PCTAJS02/39133 



tumors. The mouse Igil gene is 97% homologous to the human gene at the amino acid level and 91% 
homologous at the nucleotide level. LGH contains 8 exons, where each of the four leucine-rich 
repeat units is contained in an individual 72-bp exon. The cysteine-rich regions flanking the LRR and 
the single trans-membrane domain do not occupy individual exons (Somerville, R.P. et al. (2000) 
5 Mamm. Genome 11:622-627). 

Details of the cell division cycle may vary, but the basic process consists of three principle 
events. The first event, interphase, involves preparations for cell division, replication of the DNA, 
and production of essential proteins. In the second event, mitosis, the nuclear material is divided and 
separates to opposite sides of the ceU. The final event, cytokinesis, is division and fission of the ceU 
10 cytoplasm. The sequence and timing of cell cycle transitions is under the control of the cell cycle 
regulation system which controls the process by positive ca: negative regulatory cuxiuits at various 
check points. 

Mitosis marks the end of interphase and concludes with the onset of cytokinesis. There are 
four stages in mitosis, occurring in the following order: prophase, metaphase, anaphase and 

15 telophase. Prophase includes the formation of bi-polar mitotic spindles, composed of microtubules 
and associated proteins such as dynein, which originate fi-om polar mitotic centers. During 
metaphase, the nuclear material condenses and develops kinetochoie fibers which aid in its physical 
attachment to the mitotic spindles. The ensuing movement of the nuclear material to opposite poles 
along the mitotic spindles occurs during anaphase. Telophase includes the disappearance of the 

20 mitotic spindles and kinetochore fibers from the nuclear material. Mitosis depends on the interaction 
of numerous proteins. For example, centromere-associated proteins such as CENP-A, -B, and -C, 
play structural roles in kinetochore formation and assembly (Saffery, R. et al. (2000) Human Mol. 
Gen. 9:175-185). 

During the M phase of eukaryotic cell cycling, structural rearrangements occur ensuring 
25 appropriate distribution of cellular components between daughter cells. Breakdown of interphase 
structures into smaller subunits is common. The nuclear envelope breaks into vesicles, and nuclear 
lamins are disassembled. Subsequent phosphorylation of these lamins occurs and is maintained until 
telophase, at which time the nuclear lamina structure is reformed. cDNAs responsible for encoding 
M phase phosphorylation (MPPs) are components of U3 small nucleolar ribonucleoprotein (snoRNP), 
30 and relocalize to the nucleolus once mitosis is complete (Westendorf, J.M. et al. (1998) J. Biol. 
Chem. 9:437-449). U3 snoRNPs are essential mediators of RNA processing events. 

Proteins involved in the regulation of cellular processes such as mitosis include the Ser/Thr- 
protein phosphatases type 1 (PP-1). PP-ls act by dephosphorylation of key protems involved in the 
metaphase-anaphase transition. The gene PP1R7 encodes the regulatory polypeptide sds22, having at 
35 least six splice variants (Ceulemans, H. et al. (1999) Eur. J. Biochem. 262:36-42). Sds22 modulates 
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the activity of the catalytic subunit of PP-ls. and enhances the PP-1 -dependent dephosphorylation of 
mitotic substrates. 

Cell cycle regulatory proteins play an important role in cell proliferation and cancer. For 
example, failures in the proper execution and timing of ceU cycle events can lead to chromosome 
segregation defects resulting in aneuploidy or polyploidy. This genomic mstability is characteristic 
of transformed ceUs (Luca, F.C. and M. Winey (1998) Mol. Biol. CeU. 9:29-46). A recently 
identified protein, mMOBl, is the mammaUan homolog of yeast MOBl, an essential yeast gene 
required for completion of mitosis and maintenance of ploidy. The mammalian mMOBl is a member 
of protein complexes including protein phosphatase 2A (PP2A), and its phosphorylation appears to be 
regulated by PP2A (Moreno, C.S. et al. (2001) J. Biol. Chem. 276:24253-24260). PP2A has been 
implicated in the development of human cancers, including lung and colon cancers and leukemias. 

Cell cycle regulation involves numerous proteins interacting in a sequential manner. The 
eukaryotic cell cycle consists of several highly controUed events whose precise order ensures ' 
successful DNA replication and cell division. Cells maintain the order of these events by making 
later events dependent on the successful completion of earlier events. This dependency is enforced 
by cellular mechanisms called checkpoints. Examples of additional ceU cycle regulatory proteins 
include the histone deacetylases (HDACs). HDACs are involved in cell cycle regulation, and 
modulate chromatin structure. Human HDACl has been found to interact in vitro with the human 
Hus I gene product, whose Schizosaccharomyces pombe homolog has been hnplicated m G^M 
checkpoint control (Cai, R.L. et al. (2000) J. Biol. Chem. 275:27909-27916). 

DNA damage (Gj) and DNA replication (S-phase) checkpoints arrest eukaryotic cells at the 
transition. This arrest provides time for DNA repak or DNA repUcation to occur before entry 
into mitosis. Thus, tiie G/M checkpoint ensures that mitosis only occurs upon completion of DNA 
replication and in the absence of chromosomal damage. The Husl gene of Schizosaccharomyces 
pombe is a cell cycle checkpoint gene, as are the rad family of genes (e.g., radl and rad9) (Volkmer, 
E. and L.M. Kamitz (1999) J. Biol. Chem. 274:567-570; Kostmb C.F. et al. (1998) EMBO J. 
17:2055-2066). These genes are involved in the mitotic checkpoint, and are mduced by either DNA 
damage or blockage of replication. Induction of DNA damage or replication block leads to loss of 
function of the Husl gene and subsequent cell death. Human homologs have been identified for most 
of the rad genes, including ATM and ATR, the human homologs of rad3p. Mutations m the ATM 
gene are correlated with the severe congenital disease ataxia-telagiectasia (Savitsky, K. et ai. (1995) 
Science 268:1749-1753). The human Husl protem has been shown to act in a complex with radl 
protein which interacts with rad9, makmg them central components of a DNA damage-responsive 
protein complex of human cells (Volkmer and Kamitz, supra). 

The entry and exit of a cell from mitosis is regulated by the synthesis and destruction of a 
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35 



family of activating proteins called cyclins. Cyclins act by binding to and activating a group of 
cyclin-dependent protein kinases (Cdks) which then phosphorylate and activate selected proteins 
involved in the mitotic process. Cyclins are characterized by a large region of shared homology that 
is approximately 180 amino acids in length and refenred to as the "cyclin box" (Chapman, D.L. and 
DJ. WolgemuA (1993) Development 1 18:229-240). In addition, cyclins contain a conserved 9 amino 
acid sequence in the N-terminal region of the molecule called the "destruction box." This sequence is 
believed to be a recognition code that triggers ubiquitin-mediated degradation of cyclin B (Hunt, T. 
(1991) Nature 349: 100-101). Several types of cyclms exist (Qechanover, A. (1994) Cell 79:13-21). 
Progression through Gl and S phase is driven by the Gl cyclins and their catalytic subunits, including 
Cdk2-cycUn A, Cdk2-cyclin E, Cdk4-<:yclin D and Cdk6^yclin D. Progression through the G2-M 
transition is driven by the activation of mitotic CDK-cyclm complexes such as Cdc2^yclin A, 
Cdc2-<:yclin B 1 and Cdc2-cyclin B2 complexes (reviewed in Yang, J. and S. Kombluth (1999) 
Trends CeU Biol. 9:207-210). 

Cyclins are degraded through the ubiquitin conjugation system (UCS), a major pathway for 
15 the degradation of cellular proteins in eukaroytic cells and in some bacteria. The UCS mediates the 
elimination of abnormal proteins and regulates the half-lives of unportant regulatory proteins that 
control cellular processes such as gene transcription and cell cycle progression. The UCS is 
imphcated in the degradation of mitotic cyclin kinases, oncoproteins, tumor suppressor genes such as 
p53, viral proteins, cell surface receptors associated with signal transduction, transcriptional 
20 regulators, and mutated or damaged proteins (Ciechanover, supra). 

The process of ubiquitin conjugation and protem degradation occurs in five principle steps 
(Jentsch, S. (1992) Annu. Rev. Genet. 26: 179-207). First ubiquitin (Ub), a smaU. heat stable protein 
is activated by a ubiquitin-activating enzyme (El) in an ATP dependent reaction which binds the C- 
terminus of Ub to the tiiiol group of an mtemal cysteine residue in El. Second, activated Ub is 
tiransferred to one of several Ub-conjugating enzymes (E2). Different ubiquitin-dependent 
proteolytic padiways employ structiirally similar, but distinct ubiquitin-conjugating enzymes that are 
associated with recognition subunits which direct them to proteins carrying a particular degradation 
signal. Third, E2 ti^sfers die Ub molecule dirough its C-terminal glycme to a member of the 
ubiquitin-protem ligase family, E3. Fourtii, E3 transfers the Ub molecule to the target protein. 
Additional Ub molecules may be added to the target protein forming a multi-Ub chain structure. 
Fiftii, the ubiquinated protein is then recognized and degraded by die proteasome, a large, 
multisubunit proteolytic enzyme complex, and Ub is released for re-utihzation. 

Prior to activation, Ub is usuaUy expressed as a fusion protem composed of an N-tenninal 
ubiquitin and a C-terminal extension protein (CEP) or as a polyubiquitin protein with Ub monomers 
attached head to taU. CEPs have characteristics of a variety of regulatory proteins; most are highly 
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basic, contain up to 30% lysine and arginine residues, and have nucleic acid-binding domains 
(Monia, B.P. et al. (1989) J. Biol. Chem. 264:4093-4103). The fusion protein is an important 
intermediate which appears to mediate co-regulation of the cell's translational and protein 
degradation activities, as well as localization of the inactive enzyme to specific cellular sites. Once 
5 delivered, C-terminal hydrolases cleave the fusion protein to release a fimctional Ub (Monia et al., 
supra). 

Ub-conjugating enzymes (E2s) are important for substrate specificity in different UCS 
pathways. All E2s have a conserved domain of approximately 16 kDa called the UBC domain that is 
at least 35% identical in all E2s and contains a centrally located cysteine residue required for 

10 ubiquitin-enzyme thiolester formation (Jentsch, supra). A well conserved proline-rich element is 
located N-terminal to the active cysteine residue. Structural variations beyond this conserved domain 
are used to classify the E2 enzymes. Class I E2s consist ahnost exclusively of the conserved UBC 
domain. Class n E2s have various unrelated C-terminal extensions that contribute to substrate 
specificity and cellular localization. Class III E2s have unique N-tenninal extensions which are 

15 believed to be involved in enzyme regulation or substrate specificity. 

A mitotic cyclm-specific E2 (E2-C) is characterized by the conserved UBC domain, an N- 
terminal extension of 30 amino acids not found in other E2s, and a 7 amino acid unique sequence 
adjacent to this extension. These characteristics together with the high affinity of E2-C for cyclin 
identify it as a new class of E2 (Aristarkhov, A. et al. (1996) Proc. Nad. Acad. Sci. 93:4294-99). 

20 Ubiquitin-protein ligases (E3s) catalyze the last step in the ubiquitin conjugation process, 

covalent attachment of ubiquitin to the substrate. E3 plays a key role in determining the specificity of 
the process. Only a few E3s have been identified so far. One type of E3 ligases is the HECT 
(homologous to E6-AP C-terminus) domam protein family. One member of the family, E6-AP 
(E6-associated protein) is required, along with the human papillomavirus (HPV) E6 oncoprotein, for 

25 the ubiquitination and degradation of p53 (Schef&ier, M. et al. (1993) Cell 75:495-505). The 
C-terminal domain of HECT proteins contains the highly conserved ubiquitin-bmding cysteine 
residue. The N-terminal region of the various HECT proteins is variable and is believed to be 
mvolved in specific substrate recognition (Huibregtse, J.M. et al. (1997) Proc. Natl Acad. Sci. USA 
94:3656-3661). The SCF (Skpl-Cdc53/Cullin-Fbox receptor) family of proteins comprise another 

30 group of ubiquitin ligases (Deshaies, R. (1999) Annu. Rev. Dev. Biol. 15:435-467). Multiple proteins 
are recruited into the SCF complex, includmg Skpl, cullin, and an F box domain containing protein. 
The F box protein binds the substrate for the ubiquitination reaction and may play roles in 
determining substrate specificity and orienting the substrate for reaction. Skpl interacts with both the 
F box protein and cullin and may be involved in positioning the F box protem and cullin in the 

35 complex for transfer of ubiquitin fi^om the E2 enzyme to the protein substrate. Substrates of SCF 
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ligases include proteins involved in regulation of CDK activity, activation of transcription, signal 
transduction, assembly of kinetochores, and DNA replication. 

Sgtl was identified in a screen for genes in yeast that suppress defects in kinetochore 
function caused by mutations in Skpl (Kitagawa, K. et al. (1999) Mol. Cell 4:21-33). Sgtl interacts 
5 with Skp 1 and associates with SCF ubiquitin ligase. Defects in Sgtl cause arrest of cells at either Gl 
or G2 stages of the cell cycle. A yeast Sgtl null mutant can be rescued by human Sgtl, an indication 
of the conservation of Sgtl function across species. Sgtl is required for assembly of kinetochore 
complexes in yeast. 

Abnormal activities of the UCS are iinplicated in a number of diseases and disorders. These 
10 include, e.g., cachexia (Llovera, M. et al. (1995) Int. J. Cancer 61:138-141), degradation of the 
tumor-suppressor protein, p53 (Ciechanover, supra), and neurodegeneration such as observed in 
Alzheimer's disease (Oregon, L. et al. (1994) Biochem. Biophys. Res. Commun. 203:1731-1738). 
Since ubiquitin conjugation is a rate-limiting step in antigen presentation, the ubiquitin degradation 
pathway may also have a critical role in the immune response (Grant, E.P. et al. (1995) J. Immunol. 
15 155:3750-3758). 

Certain cell proliferation disorders can be identiBed by changes m the protein complexes that 
normally control progression through the cell cycle. A primary treatment strategy involves 
reestablishing control over cell cycle progression by manipulation of the proteins involved in cell 
cycle regulation (Nigg, E.A. (1995) BioEssays 17:471-480). 

20 Embryogenesis 

Mammalian embryogenesis is a process which encompasses the first few weeks of 
development following conception. During this period, embryogenesis proceeds from a single 
fertilized egg to the formation of the three embryonic tissues, then to an embryo which has most of its 
internal organs and all of its external features. 

23 The normal course of mammalian embryogenesis depends on the correct temporal and spatial 

regulation of a large number of genes and tissues. These regulation processes have been intensely 
studied in mouse. An essential process that is still poorly understood is the activation of the 
embryonic genome after fertilization. As mouse oocytes grow, they accumulate transcripts that are 
either translated directly into proteins or stored for later activation by regulated polyadenylation. 

30 During subsequent meiotic maturation and ovulation, the maternal genome is transcriptionally inert, 
and most maternal transcripts are deadenylated and/or degraded prior to, or together with, the 
activation of the zygotic genes at the two-cell stage (Stutz, A. et al. (1998) Genes Dev. 12:2535- 
2548). The maternal to embryonic transition involves the degradation of oocyte, but not zygotic 
transcripts, the activation of die embryonic genome, and the induction of cell cycle progression to 

35 accommodate early development. 
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MATER (Maternal Antigen That Embryos Require) was initially identified as a target of 
antibodies from mice with ovarian immunity (Tong, Z-B. and L.M. Nelson (1999) Endocrinology 
140:3720-3726). Expression of the gene encoding MATER is restricted to the oocyte, making it one 
of a limited number of known maternal-effect genes m mammals (Tong, Z-B. et al. (2000) Mamm. 
Genome 1 1 :28 1-287). The MATER protein is required for embryonic development beyond two ceUs, 
based upon preliminary results from mice in which this gene has been inactivated. The 1111-amino 
acid MATER protein contains a hydrophilic repeat region in the amino terminus, and a region 
containing 14 leucine-rich repeats in the carboxyl tennmus. These repeats iBsemble the sequence 
found in porcine ribonuclease inhibitor that is critical for protein-protein interactions. 

The degradation of maternal transcripts during meiotic maturation and ovulation may involve 
the activation of a ribonuclease just prior to ovulation. Thus the function of MATER may be to bind 
to the maternal ribonuclease and prevent degradation of zygotic transcripts (Tong et al., supra). In 
addition to its role in oocyte development and embryogenesis, MATER may also be relevant to the 
pathogenesis of ovarian immunity, as it is a target of autoantibodies in mice with autoimmune 
oophoritis (Tong and Nelson, supra). 

The maternal nnRNA D7 is a moderately abundant transcript in Xenopus laevis whose 
expression is highest in, and perhaps restricted to, oogenesis and early embryogenesis. The 
D7 protein is absent from oocytes and first begins to accumulate during oocyte maturation. Its levels 
are highest during the furst day of embryonic development and then they decrease. The loss of D7 
protein affects the maturation process itself, significantly delaymg the time course of germinal vesicle 
breakdown. Thus, D7 is a newly described protein involved in oocyte maturation (Smith, R.C. et al. 
(1988) Genes Dev. 2(10): 1296-306.) 

Many other genes are involved in subsequent stages of embryogenesis. After fertilization, the 
oocyte is guided by fimbria at the distal end of each fallopian tube mto and through the fallopian tube 
and thence into the uterus. Changes in the uterine endometrium prepare the tissue to support the 
implantafion and embryonic development of a fertilized ovum. Several stages of division have 
occurred before the dividing ovum, now a blastocyst with about 100 cells, enters the uterus. Upon 
reaching the uterus, the developing blastocyst usually remams in the uterine cavity an additional two 
to four days before implanting in the endometrium, the inner lining of the uterus. Implantation results 
from the action of trophoblast cells that develop over the surface of the blastocyst. These cells 
secrete proteolytic enzymes that digest and liquefy the cells of the endometrium. The mvasive 
process is reviewed in Fisher, S.J. and C.H. Damsky (1993; Semin Cell Biol 4:183-188) and Graham, 
C.H. and P.K. Lala (1992; Biochem Cell Biol 70:867-874). Once implantation has taken place, the 
trophoblast and other sublying cells proUferate rapidly, forming the placenta and the various 
membranes of pregnancy. (See Guyton, A.C. (1991) Textbook of Medical Physiology . 8* ed. W.B. 
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Saunders Company, Philadelphia PA, pp. 915-919.) 

The placenta has an essential role in protecting and nourishing the developing fetus. In most 
species the syncytiotrophoblast layer is present on the outside of the placenta at the fetai-matemal 
interface. This is a continuous structure, one cell deep, formed by the fusion of the constituent 
5 trophoblast cells. The syncytiotrophoblast cells play important roles in maternal-fetal exchange, in 
tissue remodeling during fetal development, and in protecting the developing fetus from the maternal 
immune response (Stoye, J.P. and J.M. Coffin (2000) Nature 403:715-717). 

A gene called syncytin is the envelope gene of a human endogenous defective provhrus. 
Syncytin is expressed in high levels in placenta, and more weakly m testis, but is not detected in any 

10 other tissues (Mi, S. et al. (2000) Nature 403:785-789). Syncytin expression m the placenta is 

restricted to the syncytiotrophoblasts. Since retroviral env proteins are often involved m promoting 
ceil fusion events, it was thought that syncytin might be involved in regulating the fusion of 
trophoblast cells into the syncytiotrophoblast layer. Experiments demonstrated that syncytin can 
mediate cell fusion in vitro, and that anti-syncytui antibodies can inhibit the fusion of placental 

15 cytotrophoblasts (Mi et al, supra). In addition, a conserved immunosuppressive domain present in 
retroviral envelope protems, and found in syncytin at amino acid residues 373-397, might be involved 
in preventing maternal immune responses against the developing embryo. 

Syncytin may also be involved in regulating trophoblast invasiveness by inducing trophoblast 
fusion and tenninal differentiation (Mi et al., supra), hisufficient trophoblast infiltration of the 

20 uterine wall is associated with placental disorders such as preeclampsia, or pregnancy induced 
hypertension, while uncontrolled trophoblast invasion is observed m choriocarcinoma and other 
gestational trophoblastic diseases. Thus syncytin function may be involved in these diseases. 
CeU Differentiation 

Multicellular organisms are comprised of diverse cell types that differ dramatically both in 
25 structure and function, despite the fact that each cell is like the others in its hereditary endowment. 
Cell differentiation is the process by which cells come to dififer in their structure and physiological 
function. The cells of a multicellular organism all arise from mitotic divisions of a smgle-ceUed 
zygote. The zygote is totipotent, meaning that it has the ability to give rise to every type of cell in the 
adult body. During development the cellular descendants of the zygote lose their totipotency and 
30 become determined. Once its prospective fate is achieved, a cell is said to have differentiated. All 
descendants of this cell will be of the same type. 

Human growth and development requires the spatial and temporal regulation of cell 
differentiation, along with cell proliferation and regulated cell death. These processes coordinate to 
control reproduction, aging, embryogenesis, morphogenesis, organogenesis, and tissue repair and 
35 maintenance. The processes involved in cell difiFerentiation are also relevant to disease states such as 
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cancer, in which case the factors regulating normal cell differentiation have been altered, allowing the 
cancerous cells to proliferate in an anaplastic, or undifferentiated, state. 

The mechanisms of differentiation involve cell-specific regulation of transcription and 
translation, so that different genes are selectively expressed at different times in different cells. 
5 Genetic experiments using the fruit fly Drosophila melcmogaster have identified regulated cascades 
of transcription factors which control pattern formation during development and differentiation. 
These include the homeotic genes, which encode transcription factors containing homeobox motifs. 
The products of homeotic genes determine how the insect's imaginal discs develop from masses of 
undifferentiated cells to specific segments containing complex organs. Many genes found to be 

10 mvolved in cell differentiation and development m Drosophila have homologs in mammals. Some 
human genes have equivalent developmental roles to their Drosophila homologs. The human 
homolog of the Drosophila eyes absent gene (eya) underlies branchio-oto-renal syndrome, a 
developmental disorder affectmg the ears and kidneys (Abdelhak, S. et al. (1997) Nat. Genet 15:157- 
164). The Drosophila slit gene encodes a secreted leucine-rich repeat containing protein expressed 

15 by the midline glial cells and required for nonnal neural development. 

At the cellular level, growth and development are govemed by the cell's decision to enter 
into or exit from the cell cycle and by the cell's conraiitment to a terminally differentiated state. 
Differential gene expression within cells is triggered in response to extracellular signals and other 
environmental cues. Such signals include growth factors and other mitogens such as retinoic acid; 

20 cell-cell and cell-matrix contacts; and environmental factors such as nutritional signals, toxic 

substances, and heat shock. Candidate genes that may play a role in differentiation can be identified 
by altered expression patterns upon induction of cell differentiation in vitro. 

The final step in cell differentiation results in a specialization that is characterized by the 
production of particular proteins, such as contractile protems in muscle cells, serum proteins in liver 

25 cells and globhis in red blood cell precursors. The expression of these specialized proteins depends at 
least in part on cell-specific transcription factors. For example, the homeobox-containing 
transcription factor PAX-6 is essential for early eye determination, specification of ocular tissues, and 
normal eye development in vertebrates. 

In the case of epidermal differentiation, the induction of differentiation-specific genes occurs 

30 either together with or following growth arrest and is believed to be linked to the molecular events 
that control irreversible growth arrest. Icrcversible growth arrest is an early event which occurs when 
cells transit from the basal to the innermost suprabasal layer of the skin and begin expressing 
squamous-specific genes. These genes include those involved in the formation of the cross-linked 
envelope, such as transglutaminase I and DI, involuciin, loricin, and small proline-rich repeat (SPRR) 
35 proteins. The SPRR proteins are 8-10 kDa in molecular mass, rich in proline, glutamme, and 
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cysteine, and contain similar repeating sequence elements. The SPRR proteins may be structural 
proteins with a strong secondary structure or metal-binding proteins such as metallothioneins. 
(Jetten, A.M. and B.L. Harvat (1997) J. Dermatol. 24:711-725; PRINTS Entry PR00021 PRORICH 
Small proline-rich protein signature.) 
5 The Wnt gene family of secreted signaling molecules is highly conserved throughout 

eukaryotic cells. Members of the Wnt family are involved in regulating chondrocyte differentiation 
within the cartilage template. Wnt-5a, Wnt-5b and Wnt-4 genes are expressed in chondrogenic 
regions of the chicken Umb, Wnt-5a being expressed in the perichondrium (mesenchymal cells 
immediately surrounding the early cartilage template). Wnt-5a misexpression delays the maturation 

10 of chondrocytes and the onset of bone collar formation m chicken lunb (Hartmann, C. and CJ. Tabin 
(20(K)) Development 127:3141-3159). 

Glypicans are a family of cell surface heparan sulfate proteoglycans that play an important 
role in cellular growth control and differentiation. Cerebroglycan, a heparan sulfate proteoglycan 
expressed in the nervous system, is involved with the motile behavior of developing neurons (Stipp, 

15 C.S. et al. (1994) J. Cell Biol. 124:149-160). 

Notch plays an active role in the differentiation of glial cells, and influences the length and 
organization of neuronal processes (for a review, see Frisen, J. and U. Lendahl (2001) Bioessays 
23:3-7). The Notch receptor signaling pathway is unportant for morphogenesis and development of 
many organs and tissues m multicellular species. Drosophila fringe proteins modulate the activation 

20 of the Notch signal transduction pathway at the dorsal- ventral boundary of the wing imaginal disc. 
Mammalian fringe-related family members participate in boundary determination during 
segmentation (Johnston, S.H. et al. (1997) Development 124:2245-2254). 

Recently a number of proteins have been found to contain a conserved cysteine-rich domain 
of about 60 amino-acid residues called the IJM domam (for Lui-1 1 Isl-1 Mec-3) (Freyd, G. et al. 

25 (1990) Nature 344:876-879; Baltz, R. et al. ( 1992) Plant Ceil 4: 1465-1466). In the LIM domam, there 
are seven conserved cysteine residues and a histidine. The LM domain binds two zinc ions 
(Michelsen, J.W. et al. (1993) Proc. Natl. Acad. Sci. U.S.A. 90:4404-4408). LIM does not bind 
DNA; rather, it seems to act as an interface for protein-protein interaction. 
Apoptosis 

30 Apoptosis is the geneticaOy controlled process by which unneeded or defective cells undergo 

programmed cell death. Selective elimination of cells is as important for morphogenesis and tissue 
remodeling as is ceU proliferation and differentiation. Lack of apoptosis may result in hyperplasia 
and other disorders associated with increased cell proliferation. Apoptosis is also a critical 
component of the immune response. Immune cells such as cytotoxic T-cells and natural killer cells 

35 prevent the spread of disease by inducing apoptosis in tumor cells and vkus-infected cells. In 
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addition, immune cells that fail to distinguish self molecules from foreign molecules must be 
eliminated by apoptosis to avoid an autoimmune response. 

Apoptotic cells undergo distinct morphological changes. Hallmarks of apoptosis include cell 
shrinkage, nuclear and cytoplasmic condensation, and alterations in plasma membrane topology. 
5 Biochemically, apoptotic cells are characterized by increased intracellular calcium concentration, 
fragmentation of chromosomal DNA, and expression of novel cell surface components. 

The molecular mechanisms of apoptosis are highly conserved, and many of the key protein 
regulators and effectors of apoptosis have been identified. Apoptosis generally proceeds in response 
to a signal which is transduced intracellularly and results in altered patterns of gene expression and 
10 protein activity. Signaling molecules such as hormones and cytokines are known both to stimulate 
and to inhibit apoptosis through interactions with cell surface receptors. Transcription factors also 
play an important role in the onset of apoptosis. A number of downstream effector molecules, 
especially proteases, have been implicated in the degradation of cellular components and the 
proteolytic activation of other apoptotic effectors. 

15 The Bcl-2 family of proteins, as well as other cytoplasmic proteins, are key regulators of 

apoptosis. There are at least 15 Bcl-2 family members within 3 subfamilies. These proteins have 
been identified in mammalian cells and in viruses, and each possesses at least one of four Bcl-2 
homology domains (BHl to BH4), which are highly conserved. Bcl-2 family proteins contain the 
BHl and BH2 domains, which are found in members of the pro-survival subfamily, while those 

20 proteins which are most similar to Bcl-2 have all four conserved domains, enabling mhibition of 
apoptosis following encounters with a variety of cytotoxic challenges. Members of the pro-survival 
subfamily include Bcl-2, Bcl-x^, Bcl-w, Mcl-l, and Al in mammals; NF-13 (chicken); CED-9 
(Caenorhabditis elegans); and viral protems BHRFl, LMW5-HL, ORF16, KS-Bcl-2, and E1B-19K. 
The BH3 domain is essential for the fimction of pro-apoptosis subfamily proteins. The two pro- 

25 apoptosis subfamilies, Bax and BH3, melude Bax, Bak, and Bok (also called Mtd); and Bik, Blk, Hrk, 
BNIP3, BmiL, Bad, Bid, and Egl-1 (C. elegatis); respectively. Members of the Bax subfamily contain 
the BHl, BH2, and BH3 domams, and resemble Bcl-2 rather closely. In contrast, members of the 
BH3 subfamily have only the 9-16 residue BH3 domain, being otherwise unrelated to any known 
protein, and only Bik and BUc share sequence similarity. The proteins of the two pro-apoptosis 

30 subfamilies may be the antagonists of pro-survival subfamily proteins. This is illustrated m C. 
elegans where Egl-1, which is required for apoptosis, binds to and acts via CED-9 (for review, see 
Adams, J.M. and S. Cory (1998) Science 281:1322-1326). 

Heterodimerization between pro-apoptosis and anti-apoptosis subfamily proteins seems to 
have a titrating effect on the fiinctions of these protein subfamilies, which suggests that relative 

35 concentrations of the members of each subfamily may act to regulate apoptosis. Heterodimerization 
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is not required for a pro-survival protein; however, it is essential in the BBB subfamily, and less so in 

the Bax subfamily. 

The Bcl-2 protein has 2 isoforms, alpha and beta, which are formed by alternative splicing. It 
forms homodiniers and heterodimers with Bax and Bak proteins and the Bcl-X isoform Bcl-Xj. 
5 Heterodimerization with Bax requires intact BHl and BH2 domains, and is necessary for pro-survival 
activity. The BH4 domain seems to be involved in pro-survival activity as well. BcI-2 is located 
within the inner and outer mitochondrial membranes, as well as within the nuclear envelope and 
endoplasmic reticulum, and is expressed in a variety of tissues. Its involvement in follicular 
lymphoma (type n chronic lymphatic leukemia) is seen in a chromosomal translocation T(14;18) 

10 (q32;q21) and involves immimoglobulin gene regions. 

The Bcl-x protein is a dominant regulator of apoptotic cell death. Alternative splicing results 
in three isoforms, Bcl-xB, a long isoform, and a short isoform. The long isoform exhibits cell death 
repressor activity, while the short isoform promotes apoptosis. BcI-xL forms heterodimers with Bax 
and Bak, although heterodimerization with Bax does not seem to be necessary for pro-survival (anti- 

15 apoptosis) activity. Bcl-xS forms heterodimers with Bcl-2. Bcl-x is found in mitochondrial 
membranes and the perinuclear envelope. Bcl-xS is expressed at high levels in developing 
lymphocytes and other cells undergoing a high rate of turnover. Bcl-xL is found in adult brain and in 
other tissues' long-lived post-mitotic cells. As with Bcl-2, the BHl, BH2, and BH4 domains are 
involved in pro-survival activity. 

20 The Bcl-w protein is found within the cytoplasm of almost all myeloid cell lines and in 

numerous tissues, with the highest levels of expression in brain, colon, and salivary gland. This 
protein is expressed in low levels in testis, liver, heart, stomach, skeletal muscle, and placenta, and a 
few lymphoid cell lines. Bcl-w contains the BHl, BH2, and BH4 domains, all of which are needed 
for its cell survival promotion activity. Although mice in which Bcl-w gene function was disrupted 

25 by homologous recombination were viable, healthy, and normal in appearance, and adult females had 
normal reproductive function, the adult males were infertile. In these males, the initial, prepuberty 
stage of spermatogenesis was largely unaffected and the testes developed normally. However, the 
seminiferous tubules were disorganized, contained niunerous apoptotic cells, and were incapable of 
producing mature sperm. This mouse model may be applicable to sohk cases of human male sterility 

30 and suggests that alteration of programmed cell death in the testes may be useful in modulating 
fertility (Print, C.G. et al. (1998) Proc. Nad. Acad. Sci. USA 95: 12424-12431). 

Studies in rat ischemic brain found Bcl-w to be overexpressed relative to its normal low 
constitutive level of expression in nonischemic brain. Furthermore, in vitro studies to examme the 
mechanism of action of Bcl-w revealed that isolated rat brain mitochondria were unable to respond to 

35 an addition of recombinant Bax or high concentrations of calcium when Bcl-w was also present. The 
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nonnal response would be the release of cytochronK c from the mitochondria. Additionally, 
recombinant Bcl-w protein was found to inhibit calcium-induced loss of mitochondrial 
transmembrane potential, which is iudicative of permeability transition. Together these fmdmgs 
suggest that Bcl-w may be a neuro-protectant agamst ischemic neuronal death and may achieve this 
protection via the mitochondrial death-regulatory pathway (Yan, C. et al. (2000) J. Cereb. Blood Flow 
Metab. 20:620-630). 

The bfl-1 gene is an additional member of the Bcl-2 family, and is also a suppressor of 
apoptosis. The Bfl-1 protein has 175 amino acids, and contains the BHl, BH2, and BH3 conserved 
domains found in BcI-2 family members. It also contains a Gin-rich NBE-terminal region and lacks 
an NH domain 1, unlike other Bcl-2 family members. The mouse Al protein shares high sequence 
homology with Bfl-1 and has the 3 conserved domams found in Bfl-1. Apoptosis induced by the p53 
tumor suppressor protein is suppressed hy^ Bfl-1, sraiilar to the action of Bcl-2, Bcl-xL, and EBV- 
BHRFl (D'Sa-Eipper, C. et al. (1996) Cancer Res. 56:3879-3882). Bfl-1 is found intracellularly. 
with the highest expression in the hematopoietic compartment, i.e. blood, spleen, and bone marrow; 
moderate expression in lung, small mtestine, and testis; and minimal expression in other tissues. It is 
also found in vascular smooth muscle cells and hematopoietic malignancies. A correlation has been 
noted between the expression level of bfl-1 and the development of stomach cancer, suggesting that 
the Bfl-1 protein is involved m the development of stomach cancer, either in the promotion of 
cancerous cell survival or in cancer (Choi, S.S. et al. (1995) Oncogene 1 1: 1693-1698). 

Cancers are characterized by continuous or uncontrolled cell proliferation. Some cancers are 
associated with suppression of normal apoptotic cell death. Strategies for treatment may mvolve 
either reestabUshmg control over cell cycle progression, or selectively stimulating apoptosis in 
cancerous cells (Nigg, E.A. (1995) BioEssays 17:471-480). hnmunological defenses against cancer 
include induction of apoptosis in mutant cells by tumor suppressors, and the recognition of tumor 
antigens by T lymphocytes. Response to mitogenic stresses is frequently controlled at the level of 
transcription and is coordinated by various transcription factors. For example, the RelTNF-kappa B 
family of vertebrate transcription factors plays a pivotal role in inflammatory and immune responses 
to radiation. The NF-kappa B family includes p50, p52, RelA, RelB, cRel, and other DNA-bmding 
protems. The p52 protem induces apoptosis, upregulates the transcription factor c-Jun, and activates 
c-Jun N-terminal kinase 1 (JNKl) (Sun, L. et al. (1998) Gene 208: 157-166). Most NF-kappa B 
protems form DNA-binding homodimers or heterodimers. Dimerization of many transcription factors 
is mediated by a conserved sequence known as the bZIP domam, characterized by a basic region 
followed by a leucine zipper. 

The Fas/Apo-I receptor (FAS) is a member of the tumor necrosis factor (TNF) receptor 
family. Upon binding its ligand (Fas ligand), the membrane-spanning FAS induces apoptosis by 
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recruiting several cytoplasmic proteins that transmit the death signal. One such protein, termed FAS- 
associated protein factor 1 (FAFl), was isolated from mice, and it was demonstrated that expression 
of FAFl in L cells potentiated FAS-induced apoptosis (Chu, K. et al. (1995) Proc. Natl. Acad. Sci. 
USA 92: 1 1894-11898). Subsequently, FAS-associated factors have been isolated from numerous 
5 other species, including fruit fly and quaU (Frohlich, T. et al. (1998) J. Cell Sci. 1 1 1:2353-2363). 
Another cytoplasmic protein that functions in the transmittal of the death signal from Fas is the Fas- 
associated death domain protein, also known as FADD. FADD transmits the death signal in both 
FAS-mediated and TNF receptor-mediated apoptotic pathways by activating caspase-8 (Bang, S. et al. 
(2000) J. Biol. Chem. 275:36217-36222). 

10 Fragmentation of chromosomal DNA is one of the hallmarks of apoptosis. DNA 

fragmentation factor (DFF) is a protein composed of two subunits, a 40-kDa caspase-activated 
nuclease termed DFF40/CAD, and its 45-kDa inhibitor DFF45/ICAD. Two mouse homologs of 
DFF45/1CAD, termed CIDE-A and CIDE-B, have recently been described (Inohara, N. et al. (1998) 
EMBO J. 17:2526-2533). ClDE-A and CIDE-B expression in mammalian cells activated apoptosis, 

15 while expression of CIDE-A alone induced DNA fragmentation. In addition, FAS-mediated 

apoptosis was enhanced by CIDE-A and CIDE-B, ftirther implicatmg these proteins as effectors that 
mediate apoptosis. 

Transcription factors play an important role in the onset of apoptosis. A number of 
downstream effector molecules, particularly proteases such as the cysteine proteases called caspases, 

20 are involved in the initiation and execution phases of apoptosis. The activation of the caspases results 
from the competitive action of the pro-survival and pro-apoptosis Bcl-2-reIated proteins (Print, CO. 
et al. (1998) Proc. Natl. Acad. Sci. USA 95: 12424-12431). A pro-apoptotic signal can activate 
initiator caspases that trigger a proteolytic caspase cascade, leadmg to the hydrolysis of target 
proteins and the classic apoptotic death of the cell. Two active site residues, a cysteme and a 

25 histidine, have been implicated in the catalytic mechanism. Caspases are among the most specific 
endopeptidases, cleaving after aspartate residues. 

Caspases are synthesized as inactive zymogens consisting of one large (p20) and one small 
(plO) subunit separated by a small spacer region, and a variable N-terminal prodomain. This 
prodomain interacts with cofactors that can positively or negatively affect apoptosis. An activating 

30 signal causes autoproteolytic cleavage of a specific aspartate residue (D297 in the caspase-1 

numbering convention) and removal of the spacer and prodomain, leaving a pl0/p20 heterodimer. 
Two of these heterodimers interact via their small subunits to form the catalytically active tetramer. 
The long prodomains of some caspase family members have been shown to promote dimerization and 
auto-processing of procaspases. Some caspases contain a "deadi effector domam" in their prodomain 

35 by which they can be recruited into self-activating complexes with other caspases and FADD protein- 
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associated death receptors or the TNF receptor complex. In addition, two dimers from different 
caspase family members can associate, changing the substrate specificity of the resultant tetramer. 

Tumor necrosis factor (TNF) and related cytokines induce apoptosis in lynq)hoid cells. 
(Reviewed in Nagata, S. ( 1997) Cell 88:355-365.) Binding of TNF to its receptor triggers a signal 
transduction pathway that results in the activation of a proteolytic caspase cascade. One such 
caspase, ICE (Interleukin-ip converting enzyme), is a cysteine protease comprised of two large and 
two small subunits generated by ICE auto-cleavage (Dinarello, C.A. (1994)FASEB J. 8:1314-1325). 
ICE is expressed primarily in monocytes. ICE processes the cytokine precursor, interieukin-lp, into 
its active form, which plays a central role m acute and chronic inflammation, bone resorption, 
myelogenous leukemia, and other pathological processes. ICE and related caspases cause apoptosis 
when overexpressed in transfected cell lines. 

A caspase recruitment domain (CARD) is found within the prodomain of several apical 
caspases and is conserved m several apoptosis regulatory molecules such as Apaf-2, RAIDD, and 
cellular inhibitors of apoptosis proteins (lAPs) (Hofmann, K. et al. (1997) Trends Biochem. Sci. 
22:155-157). The regulatory role of CARD in apoptosis may be to allow proteins such as Apaf-1 to 
associate with caspase-9 (Li, P. et al. (1997) Cell 91:479-489). A human cDNA encoding an 
apoptosis repressor with a CARD (ARC) which is expressed in both skeletal and cardiac muscle has 
been identified and characterized. ARC functions as an inhibitor of apoptosis and interacts 
selectively with caspases (Koseki, T et al. (1998) Proc. Nad. Acad. Sci. USA 95:5156-5160). AU of 
these interactions have clear effects on the control of apoptosis (reviewed in Chan S.L. and M.P. 
Mattson (1999) J. Neurosci. Res. 58:167-190; Salveson, G.S. and V.M. Dixit (1999) Proc. Natl. 
Acad. Sci. USA 96:10964-10967). 

ESI 8 was identified as a potential regulator of apoptosis in mouse T-cells (Park, E.J. et al. 
(1999) Nuc. Acid. Res. 27:1524-1530). ES18 is 428 amino acids in length, contains an N-terminal 
proline-rich region, an acidic glutamic acid-rich domain, and a putative LXXLL nuclear receptor 
binding motif The protein is preferentially expressed in lymph nodes and thymus. The level of ESI 8 
expression increases in T-cell thymoma S49.1 m response to treatment with dexamethasone, 
staurosporine, or C2-ceramide, which induce apoptosis. ESI 8 may play a role in stimulating 
apoptotic cell death in T-cells. 

The rat ventral prostate (RVP) is a model system for the study of hormone-regulated 
apoptosis. RVP epithelial cells undergo apoptosis in response to androgen deprivation. Messenger 
RNA (mRNA) transcripts that are up-regulated in the apoptotic RVP have been identified (Briehl, M. 
M. and R.L. Miesfeld (1991) Mol. Endocrinol. 5:1381-1388). One such transcript encodes RVP.l, 
the precise role of which in apoptosis has not been determined. The human homolog of RVP. 1 , 
hRVPl, is 89% identical to the rat protein (Katahira, J. et al. (1997) J. Biol. Chem. 272:26652- 
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26658). hRVPl is 220 amino acids in length and contains four transmembrane domains. hRVPl is 
highly expressed in the lung, intestine, and liver. Interestmgly, hRVPl functions as a low affinity 
receptor for the Clostridium perfringens enterotoxin, a causative agent of diarrhea in humans and 
other animals. 

5 Cytokine-mediated apoptosis plays an important role in hematopoiesis and the immune 

response. Myeloid cells, which are the stem cell progenitors of macrophages, neutrophils, 
eiythrocytes, and other blood cells, proliferate in response to specific cytokines such as 
granulocyte/macrophage-colony stimulatmg factor (GM-CSF) and interleukin-3 (rL-3). When 
deprived of GM-CSF or IL-3, myeloid ceUs undergo apoptosis. The murine requiem {req) gene 
10 encodes a putative transcription factor required for this apoptotic response in the myeloid cell line 
FDCP-1 (Gabig. T. G. et al. (1994) J. Biol. Cham. 269:29515-29519). The Req protein is 371 amino 
acids m length and contains a nuclear localization signal, a single Kruppel-typc zinc finger, an acidic 
domain, and a cluster of four unique zinc-finger motifs enriched in cysteine and histidine residues 
involved in metal binding. Expression of req is not myeloid- or apoptosis-specific, suggesting that 
15 additional factors regulate Req activity in myeloid ceU apoptosis. 

Dysregulation of apoptosis has recently been recognized as a significant factor in the 
pathogenesis of many human diseases. For example, excessive cell survival caused by decreased 
apoptosis can contribute to disorders related to cell proUferation and the immune response. Such 
disorders mclude cancer, autoimmune diseases, viral infections, and inflammation, hi contrast, 
excessive ceU death caused by increased apoptosis can lead to degenerative and immunodeficiency 
disorders such as AIDS, neurodegenerative diseases, and myelodysplastic syndromes. (Thompson, 
C.B. (1995) Science 267:1456-1462.) 

Unpaired regulation of apoptosis is also associated with loss of neurons m Alzheimer's 
disease. Alzheimer's disease is a progressive neurodegenerative disorder that is characterized by the 
formation of senile plaques and neurofibrillary tangles containing amyloid beta peptide. These 
plaques are found in Umbic and association cortices of the brain, includmg hippocampus, temporal 
cortices, cingulate cortex, amygdala, nucleus basaUs and locus caeruleus. B-amyloid peptide 
participates m signaling pathways that induce apoptosis and lead to the death of neurons (Kajkowski, 
C. et al. (2001) J. Biol. Chem. 276:18748-18756). Early in Alzheimer's pathology, physiological 
changes are visible in the cingulate cortex (Minoshuna, S. et al. (1997) Annals of Neurology 42:85- 
94). In subjects with advanced Alzheimer's disease, accumulating plaques damage the neuronal 
architecture in limbic areas and eventually cripple the memory process. 
Aging and Senescence 

Studies of the aging process or senescence have shown a number of characteristic ceUular 
and molecular changes (Fauci et al. (1998) Hanison's Principle s of Internal M^^i.w McGraw-Hill, 



20 



25 



21 



wo 03/050253 



PCT/US02/39133 



New York NY, p.37). These characteristics include increases in chromosome structural 
abnormalities, DNA cross-linkuig, incidence of single-stranded breaks in DNA, losses in DNA 
methylation, and degradation of telomere regions. In addition to these DNA changes, post- 
translational alterations of proteins increase including, deamidation, oxidation, cnsss-linking, and 
5 nonenzymatic glycation. Still fiirther molecular changes occur in the mitochondria of aging cells 
through deterioration of structure. These changes eventually contribute to decreased function in 
every organ of the body. 
Cancer 

Understandmg of the neoplastic process can be aided by the identification of molecular 
10 markers of prognostic and diagnostic incportance. Cancers are associated with oncoproteins which 
are capable of transfonning nonnal cells into malignant cells. Some oncoproteins are mutant 
isoforms of the nonnal protein while others are abnormally expressed with respect to location or level 
of expression. Nonnal ceU proliferation begins with binding of a growth factor to its receptor on the 
cell membrane, resulting in activation of a signal system that induces and activates nuclear regulatory 
15 factors to mitiate DNA transcription, subsequently leading to cell division. Classes of oncoprotems 
known to affect the ceU cycle controls include growth factors, growth factor receptors, intracellular 
signal transducers, nuclear transcription factors, and cell-cycle control proteins. Several types of 
cancer-specific genetic markers, such as tumor antigens and tumor suppressors, have also been 
identified. 
20 Oncogenes 

Oncoproteins are encoded by genes, caUed oncogenes, that are derived from genes that 
normally control cell growth and development. Many oncogenes have been identified and 
characteri2Bd. These include growth factors such as sis, receptors such as erbA, erbB, neu. and ros, 
intracellular receptors such as src, yes.fps. abl, and met, protein-serine/threonine kinases such as mos 

25 and raf, nuclear transcription factors such as jwi, fos. myc. N-myc, myb, ski, and rel, cell cycle control 
proteins such as RB and p55. mutated tumor-suppressor genes such as mdm2, dpi, pl6, and cycliti 
D, ras, set, can, sec, and gag RIO. 

Viral oncogenes are integrated into the human genome after infection of human cells by 
certain viruses. Exanq>les of vural oncogenes include v-src, v-abl, and v-fps. Transformation of 

30 normal genes to oncogenes may also occur by chromosomal translocation. The Philadelphia 
chromosome, characteristic of chronic myeloid leukemia and a subset of acute lymtphoblastic 
leukemias, results from a reciprocal translocation between chromosomes 9 and 22 that moves a 
truncated portion of the proto-oncogene c-abl to the breakpoint cluster region (bcr) on chromosome 
22. The hybrid c-abl-bcr gene encodes a chimeric protein that has tyrosine kinase activity. In chronic 

35 myeloid leukemia, the chimeric protein has a molecular weight of 210 kd, whereas in acute leukemias 
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a more active 180 kd tyrosine kinase is formed (Robbins. S.L. et al. (1994) Pathologic Basis nf 
Disease, W.B. Saunders Co., Philadelphia PA). 

The Ras superfamily of small GTPases is involved in the regulation of a wide range of 
cellular signaling pathways. Ras family proteins are membrane-associated proteins acting as 
5 molecular switches that bind GTP and GDP. hydrolyzing GTP to GDP. In the active GTP-bound 
state Ras family proteins interact with a variety of cellular targets to activate downstream signaHng 
pathways. For example, members of the Ras subfamily are essential in transducing signals from 
receptor tyrosine kinases (RTKs) to a series of serine/threonine kinases which control ceU growth and 
differentiation. Activated Ras genes were initiaUy found in hmnan cancers and subsequent studies 
10 confnmed that Ras function is critical in the determination of whether cells continue to grow or 
become temiinaUy differentiated (Barbacid. M. (1987) Annu. Rev. Biochem. 56:779-827; Treisman. 
R. (1994) Curr. Opin. Genet. Dev. 4:96-98). Mutant Ras proteins, which bind but can not hydrolyze 
GTP, are permanenUy activated, and cause continuous cell proUferation or cancer. 

Activation of Ras family proteins is catalyzed by guanine nucleotide exchange factors (GEFs) 
15 which catalyze the dissociation of bound GDP and subsequent binding of GTP. A recenUy 
discovered RalGEF-like protein. RGL3, interacts with both Ras and the related protein Rit. 
Constitutively active Rit. like Ras, can induce oncogenic transformation, although since Rit fails to 
interact with most known Ras effector proteins, novel cellular targets may be involved in Rit 
transfoming activity. RGL3 interacts with both Ras and Rit, and thus may act as a downstream 
20 effector for these proteins (Shao. H. and D.A. Andres (2000) J. Biol. Chem. 275:26914-26924). 
Tumor antif yans 

Tumor antigens are cell surface molecules that are differentially expressed in tmnor cells 
relative to non-tumor tissues. Tumor antigens make tumor cells immunologically distinct from 
normal cells and are potential diagnostics for human cancers. Several monoclonal antibodies have 
25 been identified which react specificaUy with cancerous ceUs such as T-cell acute lymphoblastic 
leukemia and neuroblastoma (Minegishi, M. et al. (1989) Leukemia Res. 13:43-51; Takagi. S. et al. 
(1995) Int. J. Cancer 61:706-715). In addition, the discovery of high level expression of thl HER2 
gene in breast tumors has led to the development of therapeutic treatments (Liu, E. et al. (1992) 
Oncogene 7: 1027-1032; Ken,, J.A. (1993) Am. J. Respir. Cell Mol. Biol. 9:448-454). Tumor 
30 antigens are found on the cell surface and have been characterized either as membrane proteins or 
glycoproteins. For example, MAGE genes encode a family of tumor antigens recognized on 
melanoma cell surfaces by autologous cytolytic T lymphocytes. Among the 12 human MAGE genes 
isolated, half are differentially expressed in tumors of various histological types (De Plaen, E. et al 
(1994) Immunogenetics 40:360-369). None of the 12 MAGE genes, however, is expressed in healdiy 
35 tissues except testis and placenta. 
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Tumor suppressors 

Tumor suppressor genes are generally defined as genetic elements whose loss or inactivation 
contributes to the deregulation of cell proliferation and the pathogenesis and progression of cancer. 
Tumor suppressor genes normally function to control or inhibit cell growth in response to stress and 
5 to limit the proliferative life span of the cell. Several tumor suppressor genes have been identified 
including the genes encoding the retinoblastoma (Rb) protein, p53, and the breast cancer 1 and 2 
proteins (BRCAl and BRCA2). Mutations in these genes are associated with acquired and inherited 
genetic predisposition to the development of certain cancers. 

The role of p53 in the pathogenesis of cancer has been extensively studied. (Reviewed m 

10 Aggarwal, M. L. et al. (1998) J. Biol. Chem. 273:1-4; Levine, A. (1997) CeU 88:323-331.) About 
50% of all human cancers contain noutations in the p53 gene. These mutations result in either the 
absence of functional p53 or, more commonly, a defective form of p53 which is overexpressed. p53 
is a transcription factor that contains a central core domain required for DNA binding. Most cancer- 
associated mutations in p53 localize to diis domain. In normal proliferating cells, p53 is expressed at 

15 low levels and is rapidly degraded. p53 expression and activity is induced in response to DNA 
damage, abortive mitosis, and other stressful stimuli. In these instances, p53 induces apoptosis or 
arrests cell growth until the stress is removed. Downstream effectors of p53 activity include 
apoptosis-specific proteins and cell cycle regulatory proteins, mcluding Rb, oncogene products, 
cyclins, and cell cycle-dependent kinases. 

20 The metastasis-suppressor gene KAIl (CD82) has been reported to be related to the tumor 

suppressor gene p53. KAIl is mvolved in the progression of human prostatic cancer and possibly 
lung and breast cancers when expression is decreased. KAIl encodes a member of a structurally 
distinct family of leukocyte surface glycoproteins. The family is known as either the tetraspan 
transmembrane protein family or transmembrane 4 superfamily (TM4SF) as the members of this 

25 family span the plasma membrane four times. The family is composed of integral membrane protems 
having a N-terminal membrane-anchoring domain which fimctions as both a membrane anchor and a 
translocation signal during protein biosynthesis. The N-tenninal membrane-anchoring domam is not 
cleaved during biosynthesis. TM4SF proteins have three additional transmembrane regions, seven or 
more conserved cysteine residues, are similar in size (218 to 284 residues), and all have a large 

30 extracellular hydrophilic domain with three potential N-glycosylation sites. The promoter region 
contains many putative binding motifs for various transcription factors, including five AP2 sites and 
nine Spl sites. Gene structure comparisons of KAIl and seven other members of the TM4SF indicate 
that the splicing sites relative to the different structural domains of the predicted proteins are 
conserved. This suggests that these genes are related evolutionarily and arose through gene 

35 duplication and divergent evolution (Levy, S. et al. (1991) J. Biol. Chem 266: 14597-14602; Dong, 
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J.T. et al. (1995) Science 268:884-886; Dong, J.T. et al., (1997) Genomics 41:25-32). 

The Leucine-rich gene-Glioma Inactivated (LGIl) protein shares homology with a number of 
transmembrane and extracellular proteins which function as receptors and adhesion proteins. LGIl is 
encoded by an LLR (leucine-rich, repeat-containing) gene and maps to 10q24. LGIl has four LLRs 
5 which are flanked by cysteine-rich regions and one transmembrane domain (Somerville, R.P. et al. 
(2000) Mamm. Genome 1 1:622-627). LGIl expression is seen predominantly in neural tissues, 
especially brain. The loss of tumor suppressor activity is seen in the inactivation of the LGIl protein 
which occurs during the transition from low to high-grade tumors in malignant gUomas. The 
reduction of LGIl expression in low grade brain tumors and its significant reduction or absence of 
10 expression in malignant gliomas suggests that it could be used for diagnosis of glial tumor 
progression (Chemova, O.B. et al. (1998) Oncogene 17:2873-2881). 

The ST13 tumor suppressor was identified in a screen for factors related to colorectal 
carcinomas by subtractive hybridization between cDNA of normal mucosal tissues and naRNA of 
colorectal carcinoma tissues (Cao, J. et al. (1997) J. Cancer Res. Clin. Oncol. 123:447-451). 8713 is 
15 down-regulated in human colorectal carcinomas. 

Mutations in the von Hippel-Lindau (VHL) tumor suppressor gene are associated with retinal 
and central nervous system hemangioblastomas, clear cell renal carcinomas, and pheochromocytomas 
(Hoffman, M. et al. (2001) Huia Mol. Genet. 10:1019-1027; Kamada. M. (2001) Cancer Res. 
6 1 :4 1 84-4 189). Tumor progression is linked to defects or inactivation of the VHL gene. VHL 
20 regulates die expression of transforming growth factor-cc, the GLUT-1 glucose transporter and 

vascular endothelial growth factor. The VHL protem associates with elongin B, elongm C, Cul2 and 
Rbxl to form a con^lex that regulates die transcriptional activator hypoxia-inducible factor (HSF). 
HIF induces genes mvolved in angiogenesis such as vascular endothelial growth factor and platelet- 
derived growth factor B. Loss of control of HIF caused by defects in VHL results in the excessive 
23 production of angiogenic peptides. VHL may play roles in inhibition of angiogenesis, cell cycle 
control, fibronectin matrix assembly, cell adhesion, and proteolysis. 

Mutations m tumor suppressor genes are a common feature of many cancers and often appear 
to affect a critical step in the pathogenesis and progression of tumors. Accordingly, Chang, F. et al. 
(1995; J. Clin. Oncol. 13: 1009-1022) suggest that it may be possible to use either the gene or an 
30 antibody to the expressed protein 1) to screen patients at increased risk for cancer, 2) to aid in 

diagnosis made by traditional methods, and 3) to assess the prognosis of individual cancer patients. 
In addition, Hamada, K. et al. (1996; Cancer Res. 56:3047-3054) are investigating die introduction of 
p53 into cervical cancer cells via an adenoviral vector as an experimental therapy for cervical cancer. 
The PR-domain genes were recently recognized as playing a role in human tumorigenesis. 
35 PR-domain genes normally produce two protein products: the PR-plus product, which contains the PR 
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domain, and the PR-minus product which lacks this domain. In cancer cells, PR-plus is disrupted or 
overexpressed, while PR-minus is present or overexpressed. The im^balance in the amount of these 
two proteins appears to be an important cause of malignancy (Jiang, G.L. and S. Huang (2000) Histol. 
Histopathol. 15:109-117). 

Many neoplastic disorders in humans can be attributed to inappropriate gene transcription. 
Malignant cell growth may result from either excessive expression of tumor promoting genes or 
insufficient expression of tumor suppressor genes (Cleary, M.L. (1992) Cancer Surv. 15:89-104). 
Chromosomal translocations may also produce chimeric loci which fuse the coding sequence of one 
gene with the regulatory regions of a second unrelated gene. An important class of transcriptional 
regulators are the zinc finger proteins. The zinc finger motif, which binds zinc ions, generally 
contains tandem repeats of about 30 amino acids consisting of periodically spaced cysteine and 
histidine residues. Examples of this sequence pattern include the C2H2-type, C4-type, and C3HC4- 
type zinc fmgers, and the PHD domain (Lewm, B. (1990) Genes IV . Oxford University Press, New 
York, NY, and Cell Press, Cambridge, MA, pp. 554-570; Aasland, R., et al. (1995) Trends Biochem. 
Sci. 20:56-59). One clinically relevant zinc-finger protein is WTl, a tumor-suppressor protein that is 
mactivated in children with Wihn's tumor. The oncogene bcl-6, which plays an inq)ortant role in 
large-cell lymphoma, is also a zinc-finger protein (Papavassiliou, A.G. (1995) N. Engl. J. Med. 
332:45-47). 

Tumor responsive proteins 

Cancers, also called neoplasias, can be divided into three categories: carcinomas, sarcomas, 
and leukemias. Carcinomas are malignant growths of soft epithelial cells that may infiltrate 
surrounding tissues and give rise to metastatic tumors. Sarcomas may be of epithelial origin or arise 
from connective tissue. Leukemias are progressive malignancies of blood-forming tissue 
characterized by proliferation of leukocytes and their precursors, and may be classified as 
myelogenous (granulocyte- or monocyte-derived) or lymphocytic (lyn^hocyte-derived). 
Tumorigenesis refers to die progression of a tumor's growth from its mception. Malignant cells may 
be quite similar to normal cells within the tissue of origin or may be undifferentiated (anaplastic). 
Tumor cells may possess few nuclei or one large polymorphic nucleus. Anaplastic cells may grow in 
a disorganized mass that is poorly vascularized and as a result contains large areas of ischemic 
necrosis. Differentiated neoplastic ceUs may secrete the same proteins as the tissue of origin. 
Cancers grow, infiltirate, invade, and destroy the surroundmg tissue through direct seeding of body 
cavities or surfaces, tiffough lymphatic spread, or through hematogenous spread. Cancer remains a 
major public healtii concern and current preventative measures and ti-eahnents do not match the needs 
of most patients. Understanding of the neoplastic process of tumorigenesis can be aided by the 
identification of molecular markers of prognostic and diagnostic importance. 
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Current forms of cancer treatment include the use of immunosuppressive drugs (Morisaki, T. 
et al. (2000) Anticancer Res. 20:3363-3373; Geoerger, B. et al. (2001) Cancer Res. 61: 1527-1532). 
The identification of proteins involved in cell signaling, and specifically proteins that act as receptors 
for immunosuppressant drugs, may facilitate the development of anti-tumor agents. For exanq)le, 
5 immunophilins are a family of conserved proteins found in both prokaryotes and eukaryotes that bind 
to immunosuppressive drugs with varying degrees of specificity. One such group of immunophilic 
proteins is the peptidyl-prolyl cis-trans isomerase (EC 5.2.1.8) family (PPIase, rotamase). These 
enzymes, first isolated from porcine kidney cortex, accelerate protein folding by catalyzing the cis- 
trans isomerization of proline imidic peptide bonds in oligopeptides (Fischer, G. and F.X. Schmid 

10 (1990) Biochemistry 29:2205-2212). Included within the immunophilin family are the cyclophilins 
(e.g., peptidyl-prolyl isomerase A or PPIA) and FK-bmding protein (e.g., FKBP) subfamilies. 
Cyclophilins are multifunctional receptor proteins which participate in signal transduction activities, 
mcluding those mediated by cyclosporin (or cyclosporine). The PPIase domain of each family is 
highly conserved between species. Although structurally distinct, these multifunctional receptor 

15 protems are involved in numerous signal transduction pathways, and have been implicated in folding 
and trafficking events. 

The immunophilin protein cyclophilin binds to the immunosuppressant drug cyclosporin A. 
FKBP, another immunophilin, binds to FK506 (or rapamycin). Rapamycin is an immunosuppressant 
agent that arrests cells in the G, phase of growth, inducmg apoptosis. Like cyclophilin, this macrolide 

20 antibiotic (produced by Streptomyces tsukubaemis) acts by binding to ubiquitous, predominantly 
cytosolic immunophilin receptors. These immunophilin/immunosuppressant complexes (e.g., 
cyclophilin A/cyclosporin A (CypA/CsA) and FKBP12/FK506) achieve their therapeutic results 
through inhibition of the phosphatase calcineurin, a calcium/cahnodulm-dependent protem kinase that 
participates in T-ceU activation (Hamilton, G.S. and J.P. Steiner (1998) J. Med. Chem. 41: 5119- 

25 5143). The murine fkbp51 gene is abundantly expressed in immunological tissues, including the 
thymus and T lymphocytes (Baughman, G. et al. (1995) Molec. Cell. Biol. 15: 4395-4402). 
FKBP12/rapamycin-directed immunosuppression occurs through binding to TOR (yeast) or FRAP 
(FKBP12-rapamycin-associated protein, in mammalian cells), the kinase target of rapamycin essential 
for maintaining normal cellular growth patterns. Dysfunctional TOR signaling has been linked to 

30 various human disorders including cancer (Metcalfe, S.M. et al. (1997) Oncogene 15: 1635-1642; 
Bmami, S. et al. (2001) FASEB J. 15:351-361), and autoimmunity (Damoiseaux, J.G. et al. (1996) 
Transplantation 62:994-1001). 

Several cyclophilin isozymes have been identified, including cyclophilin B, cyclophilin C, 
mitochondrial matrix cyclophilm, bacterial cytosolic and periplasmic PPIases, and natural-killer cell 

35 cyclophilin-related protein possessing a cyclophilin-type PPIase domain, a putative tumor-recognition 
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complex involved in the function of natural killer (NK) cells. These cells participate in the innate 
cellular immune response by lysing virally-infected cells or transformed ceUs. NK cells specifically 
target cells that have lost their expression of major histocompatibility complex (MHC) class I genes 
(common during tumorigenesis), endowing them with the potential for attenuating tumor growth. A 
5 150-kDa molecule has been identified on the surface of human NK ceUs that possesses a domain 
which is highly homologous to cyclophilin/pe ptidyl-prolyl cis-trans isomerase. This cyclophilin- 
type protein may be a component of a putative tumor-recognition complex, a NK tumor recognition 
sequence (NK-TR) (Anderson, S.K. et al. (1993) Proc. Natl. Acad. Sci. USA 90:542-546). The 
NKTR tumor recognition sequence mediates recognition between tumor cells and large granular 
10 lymphocytes (LGLs), a subpopulation of white blood cells (conq>rised of activated cytotoxic T cells 
and natural killer cells) capable of destroying tumor targets. The protein product of the NKTR gene 
presents on the surface of LGLs and facilitates binding to tumor targets. Mote recently, a mouse Nktr 
gene and promoter region have been located on chromosome 9. The gene encodes a NK-cell-specific 
150-kDa protein (NK-TR) that is homologous to cyclophilin and other tumor-responsive proteins 
15 (Sunons-Evelyn, M. et al. (1997) Genomics 40:94-l(M)). 

Other proteins that interact with tumorigenic tissue mclude cytokines such as tumor necrosis 
factor (TNF). The TNF family of cytokines are produced by lyn^hocytes and macrophages, and can 
cause the lysis of transformed (tumor) endothelial cells. Endothelial protem 1 (Edpl) has been 
identified as a human gene activated transcriptionally by TNF-alpha in endothelial cells, and a TNF- 
20 alpha inducible Edpl gene has been identified in the mouse (Swift, S. et al. (1998) Biochim. Biophys. 
Acta 1442:394-398). 
Expression p rofiling 

Microarrays are analytical tools used in bioanalysis. A microarray has a plurality of 
molecules spatially distributed over, and stably associated with, the surface of a solid support. 

25 Microarrays of polypeptides, polynucleotides, and/or antibodies have been developed and find use in 
a variety of applications, such as gene sequencing, monitoring gene expression, gene mapping, 
bacterial identification, drug discovery, and combinatorial chemistry. 

One area in particular in which microairays fmd use is in gene expression analysis. Array 
technology can provide a sunple way to explore the expression of a smgle polymorphic gene or the 

30 expression profile of a large number of related or unrelated genes. When the expression of a single 
gene is examined, arrays are employed to detect the expression of a specific gene or its variants. 
When an expression profile is examined, arrays provide a platform for identifying genes that are 
tissue specific, are affected by a substance being tested in a toxicology assay, are part of a signaling 
cascade, carry out housekeepmg functions, or are specifically related to a particular genetic 

35 predisposition, condition, disease, or disorder. 
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Adipocyte maturation 

The primary function of adipose tissue is the ability to store and release fat during periods of 
feedmg and fasting. White adipose tissue is the major energy reserve in periods of fasting, and its 
reserve is mobilized during energy deprivation. Adipose tissue is one of the primary target tissues for 
insulin, and adipogenesis and insulin resistance are linked in type H diabetes, non-insulin dependent 
diabetes mellitus (MDDM). Cytologically the conversion of a preadipocytes into mature adipocytes 
is characterized by deposition of fat droplets around the nuclei. The conversion process in vivo can 
be induced by thiazolidmediones and other PPARy agonists (Adams et al. (1997) J Clin Invest 
100:3 149-3 153) which also lead to increased sensitivity to insulin and reduced plasma glucose and 
blood pressure. 

Pickup and Crook (1998; Diabetologia 41:1241-8) have suggested that MDDM may result 
from the inabiUty of an individual with hypersensitive acute-phase immune response to carry out 
normal cell siguaUng and repak. Steps in diis process are highly correlated with long-term lifestyle 
and environment and include: 1) high glucose stimulation of insulm and cytokine production, 2) 
influence of various cytokines on tissue remodeling during adipocyte differentiation and their affect 
on signaling pathways, and 3) occurrence of tissue damage when cytokines continue to be produced, 
extracellular matrix components (ECM) are not recycled, and homeostasis is not timely restored. 
Many cytokines and the receptors with which they interact are implicated in this process. These 
cytokines include tumor necrosis factor, connective tissue growth factor, transforming growth factor- 
beta, interleukin (IL)-13 and thek receptors. Tumor necrosis factor contributes to insulin resistance 
by mhibiting insulin-stimulated tyrosine phosphorylation of the insulin receptor. This, in turn, 
prevents the insulin receptor from participating in normal signaling processes (Skohiik and 
Marcusohn (1996) Cytokine Growth Factor Rev 7: 161-173; HotamisUgil (1999) J Mem med 
245:621-625). Connective tissue growth factor mediates the buildup of mesengUal matrix (Murphy et 
al. (2000) J Biol Chem 274:5830-5834). Transforming growth factor-beta mediates the buildup of 
mesenglial matrix of the kidney and affects vascular function through its mteraction with the inositol 
trisphosphate receptor, a key intracellular calcium channel (Shanna and McGowan (2000) Cytokme 
Growth Factor Rev 11:115-123). 

IL-13 and IL-4 are immuno-regulatoiy cytokines which share many overlapping biological 
properties. They both promote growth of B-ceUs (McKenzie et al. (1993) Proc Natl Acad Sci 
90:3735-3739), induce expression of germ line Ce transcripts, and direct naive B cells to switch to the 
synthesis of IgE and IgG4 ( Punnomen et al. (1993) Proc Natl Acad Sci 90:3730-3734). SimUarly, 
different isofonns of the IL-13 and IL-4 receptors interact to form four types of IL-13 receptor 
conq)lexes. In some instances, IL-13 utilizes a receptor complex composed of the IL-4 receptor-a 
chain (Ra) and the ]L-13Ra. Although the specific role of each chain in IL-13 signaling is unclear. 
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Ba/F3 cells transfected with IL-13Ral display a mitogenic response to IL-13, but cells transfected 
with mouse IL-13Ra2 do not. lii addition, a soluble lL-13Ra2/Fc fusion protein blocks the mitogenic 
response to IL-13 (Donaldson et al. (1998) J Immunol 161:2317-2324). This suggests that IL-13Ra2 
could serve as a dominant negative inhibitor or decoy receptor for IL-13. However, in colonic 
5 carcinoma cell Imes, the receptor complex displayed growth inhibition which was associated with 
tyrosine phosphorylation of insulin receptor substrate-1. It is evident that more research is needed to 
establish 1) which isoforms of the receptor complex promote cell growth and which inhibit cell 
growth and 2) whether this varies by cell or tissue type. 

Most adipocyte research has been carried out using mouse cell lines. Recent evidence, 
10 however, indicates that culture conditions which stimulate mouse preadipocyte differentiation are 
different from those which induce human preadipocytes. In addition to the known genetic differences 
between these species, diploid human primary cells respond dififerently than aneuploid mouse cells. 
Breast Cancer 

There are more than 180,000 new cases of breast cancer diagnosed each year, and the 
15 mortality rate for breast cancer approaches 10% of all deaths in females between the ages of 45-54 
(Gish, K. (1999) AWIS Magazine 28:7-10). However the survival rate based on early diagnosis of 
localized breast cancer is extremely high (97%), con^jared with the advanced stage of the disease in 
which the tumor has spread beyond the breast (22%). Current procedures for clinical breast 
examination are lacking in sensitivity and specificity, and efforts are underway to develop 

20 comprehensive gene expression profiles for breast cancer that may be used in conjunction with 

conventional screening methods to inq)rove diagnosis and prognosis of this disease (Perou, CM. et 
al. (2000) Nature 406:747-752). 

Mutations in two genes, BRCAl and BRCA2, are known to greatly predispose a woman to 
breast cancer and may be passed on from parents to children (Gish, supra). However, this type of 

25 hereditary breast cancer accounts for only about 5% to 9% of breast cancers, while the vast majority 
of breast cancer is due to non-inherited mutations that occur La breast epithelial cells. 

The relationship between expression of epidermal growth factor (EGF) and its receptor, 
EGFR, to human m a mm ary carcinoma has been particularly well studied. (See Khazaie, K. et al. 
(1993) Cancer and Metastasis Rev. 12:255-274, and references cited therein for a review of this area.) 

30 Overexpression of EGFR, particularly coupled with down-regulation of the estiogen receptor, is a 
marker of poor prognosis in breast cancer patients. In addition, EGFR expression in breast tumor 
metastases is frequentiy elevated relative to the primary tiunor, suggesting that EGFR is involved in 
tumor progression and metastasis. This is supported by accumulating evidence that EGF has effects 
on cell functions related to metastatic potential, such as cell motility, chemotaxis, secretion and 

35 differentiation. Changes in expression of other members of the erbB receptor family, of which EGFR 
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is one, have also been implicated in breast cancer. The abundance of erbB receptors, such as HER- 
2/neu, HER-3, and HER^, and their ligands in breast cancer points to their functional importance in 
the pathogenesis of the disease, and may therefore provide targets for therapy of the disease (Bacus, 
S. S. et al. (1994) Am. J. Clin. Pathol. 102:S13-S24). Other known markers of breast cancer include a 
human secreted frizzled protein mRNA that is downregulated in breast tumors; the matrix Gla 
protein which is overexpressed is human breast carcinoma cells; Drgl or RTP, a gene whose 
expression is diminished in colon, breast, and prostate tumors; maspin, a tumor suppressor gene 
downregulated in invasive breast carcinomas; and CaN19, a member of the SlOO protein family, all of 
which are down regulated in mammary carcinoma cells relative to normal mammary epidielial cells 
(Zhou, Z. et al. (1998) Int. J. Cancer 78:95-99; Chen, L. et al. (1990) Oncogene 5:1391-1395; Ulrix, 
W. et al (1999) FEBS Lett 455:23-26; Sager, R. et al. (1996) Curr. Top. Microbiol. Immunol. 213:51- 
64; and Lee, S. W. et al. (1992) Proc. Natl. Acad. Sci. USA 89:2504-2508). 

Cell lines derived from human mammary epithelial cells at various stages of breast cancer 
provide a useful model to study the process of malignant transformation and tumor progression as it 
has been shown that these cell lines retain many of the properties of their parental tumors for lengthy 
culture periods (Wistuba, I.I. et al. (1998) Clin. Cancer Res. 4:2931-2938). Such a model is 
particularly useful for comparing phenotypic and molecular characteristics of human mammary 
epithelial cells at various stages of malignant transformation. 
Prostate Cancer 

Prostate cancer is a common malignancy in men over the age of 50, and the incidence 
increases with age. In the US, there are approximately 132,000 newly diagnosed cases of prostate 
cancer and more than 33,000 deaths from the disorder each year. 

Once cancer cells arise in the prostate, they are stimulated by testosterone to a more rapid 
growth. Thus, removal of the testes can indirectly reduce both rapid growth and metastasis of the 
cancer. Over 95 percent of prostatic cancers are adenocarcinomas which originate in the prostatic 
acini. The remaining 5 percent are divided between squamous cell and transitional cell carcinomas, 
both of which arise in the prostatic ducts or other parts of the prostate gland. 

As with most cancers, prostate cancer develops through a multistage progression ultimately 
resulting in an aggressive, metastatic phenotype. The initial step m tumor progression involves the 
hyperproliferation of normal luminal and/or basal epithelial cells that become hyperplastic and evolve 
mto early-stage tumors. The early-stage tumors are localized in the prostate but eventually may 
metastasize, particularly to the bone, brain or lung. About 80% of these tumors remain responsive to 
androgen treatment, an important hormone controlling the growth of prostate epithelial cells. 
However, in its most advanced state, cancer growth becomes androgen-independent and there is 
currently no known treatment for this condition. 
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A primary diagnostic marker for prostate cancer is prostate specific antigen (PSA). PSA is a 
tissue-specific serine protease almost exclusively produced by prostatic epitlielial cells. The quantity 
of PSA correlates with the number and volume of the prostatic epithelial cells, and consequently, the 
levels of PSA are an excellent indicator of abnormal prostate growth. Men with prostate cancer 
exhibit an early linear increase in PSA levels followed by an exponential increase prior to diagnosis. 
However, since PSA levels are also influenced by factors such as inflammation, androgen and other 
growth factors, some scientists maintain that changes in PSA levels are not useftil in detecting 
individual cases of prostate cancer. 

Current areas of cancer research provide additional prospects for markers as well as potential 
therapeutic targets for prostate cancer. Several growth factors have been shown to play a critical role 
in tumor development, growth, and progression. The growth factors Epidermal Growth Factor 
(EOF), Fibroblast Growth Factor (FGF), and Tumor Growth Factor alpha (TGFa) are unportant in the 
growth of normal as well as hyperproliferative prostate epithelial cells, particularly at early stages of 
tumor development and progression, and affect signaling pathways in diese cells in various ways (Lin 
J siaL (1999) Cancer Res. 59:2891-2897; Putz T eLal (1999) Cancer Res 59:227-233). The TGF-p 
family of growth factors are generally expressed at increased levels m human cancers and the high 
expression levels in many cases correlates with advanced stages of malignancy and poor survival 
(Gold LI (1999) Crit Rev Oncog 10:303-360). Finally, there are human cell lines representing both 
the androgen-dependent stage of prostate cancer (LNCap) as well as the androgen-independent, 
hormone refractory stage of the disease (PC3 and DU-145) that have proved useful m studying gene 
expression patterns associated with the progression of prostate cancer, and the effects of cell 
treatments on these expressed genes (Chung ID (1999) Prostate 15:199-207). 
Lung Cancer 

Lung cancer is the leading cause of cancer death for men and the second leading cause of 
cancer death for women in flie U.S. The vast majority of lung cancer cases are attributed to smoking 
tobacco, and increased use of tobacco products in third world countries is projected to lead to an 
epidemic of lung cancer in these countries. Exposure of the bronchial epithelium to tobacco smoke 
appears to result in changes in tissue morphology, which are thought to be precursors of cancer. Lung 
cancers are divided mto four histopathologically distinct groups. Three groups (squamous ceU 
carcinoma, adenocarcinoma, and large cell carcmoma) are classified as non-smaU cell lung cancers 
(NSCLCs). The fourth group of cancers is referred to as small cell lung cancer (SCLC). Collectively, 
NSCLCs account for -70% of cases while SCLCs account for -18% of cases. The molecular and 
cellular biology underlying the development and progression of lung cancer are mcompletely 
understood. 

Deletions on chromosonK 3 are conanon in this disease and are thought to indicate the 
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presence of a tumor suppressor gene in this region. Activating mutations in K-ras are commonly 
found in lung cancer and are the basis of one of the mouse models for the disease. 
Leukemias 

Leukemias can be classified into four major categories, and all involve malignant 
5 transformation of pluripotent stem ceUs. Acute leukemias, both lymphoblastic (ALL) and myeloid 
(AML) types, are characterized by the presence of immature cells in the blood. Chronic leukemias, 
both lymphocytic (CLL) and myelocytic (CML), are associated with mature, differentiated cells, but 
proportions of each cell type are abnormal. For example, CLL patients usually have clonal expansion 
of B ceU lymphocytes. CML patients often have granulocytes of all stages of maturity present in 
10 blood, bone marrow, and other organs. Monoclonal antibodies specific for B- and T-cells are helpful 
diagnostic tools, in addition to histological analysis. Disease progresses as normal hematopoietic 
bone marrow is displaced by malignant cells. Cause has been determined to be genetic in some cases, 
and chemical or radiation-induced in others. 
Genes Repilatftd in Dendritic Cell Differentiation 

Array technology can provide a srniple way to explore the expression of a smgle polymorphic 
gene or the expression profile of a large number of related or unrelated genes. When the expression 
of a single gene is examined, arrays are employed to detect the expression of a specific gene or its 
variants. When an expression profile is examined, arrays provide a platform for examining which 
genes are tissue specific, carrying out housekeeping functions, parts of a signalmg cascade, or 
specifically related to a particular genetic predisposition, condition, disease, or disorder. 

The potential application of gene expression profiling is particularly relevant to 
characterizing lineage differences during cellular development that will improve diagnosis, prognosis, 
and treatment of disease. For exanq)le, both the levels and sequences expressed in dendritic cells 
fi-om subjects with autoimmunity may be compared with the levels and sequences expressed in 
25 dendritic cells from normal subjects. 

Dendritic cells (DC) are antigen presenting ceUs (APC) that play a key role in the primary 
immune response because of their unique ability to present antigens to naive T cells. Jn addition, DC 
differentiate into separate subsets that sustain and regulate immune responses following initial contact 
with antigen. DC subsets include those that preferentially induce particular T helper 1 (Thl) or T 
helper 2 (Th2) responses and those that regulate B ceU responses. Moreover, DC are increasingly 
being used to manipulate immune responses, either to downregulate an aberrant autoimmune response 
or to enhance vaccination or a tumor-specific response. 

DC are fimctionaUy specialized in correlation with their particular differentiation state. 
CD34+ myeloid cells found in the bone marrow mature in response to as yet unclear signals into 
CD14+ CD11C+ monocytes. An innate or antigen non-specific response takes place mitially when 
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monocytes circulate to nonlymphoid tissues and respond to lipopolysaccharide (LPS), a bacterially- 
derived mitogen, and virases. Such direct encounter with antigen causes secretion of pro- 
inflammatory cytokines that attract and regulate natural killer cells, macrophages, and eosinophils in 
the first line of defense against invading pathogens. Monocytes then mature into DC, which capture 
antigen highly efficiently through endocytosis and antigen receptor uptake. Antigen processing and 
presentation trigger activation and differentiation into mature DC that express MHC class II 
molecules on the cell surface and efficiently activate T cells, initiating antigen-specific T cell and B 
cell responses. In turn, T cells activate DC dirough CD40 ligand - CD40 interactions, which stimulate 
expression of the costimulatory molecules CD80 and CD86, the latter most potent m an^Ufying T 
cell responses. DC interaction via CD40 with T cells also stimulates the production of inflammatory 
cytokines such as TNF alpha and IL-1. Engagement of RANK, a member of the TNF receptor family 
by its ligand, TRANCE, which is expressed on activated T cells, enhances the survival of DC through 
inhibition of apoptosis, thereby enhancing T cell activation. The maturation and differentiation of 
monocytes mto mature DC links the antigen non-specific innate immune response to the antigen- 
specific adaptive immune response. 

The process by which monocytes differentiate into unmature dendritic cells in vivo has not 
been fully elucidated. Incubation of monocytes with granulocyte-macrophage colony stimulating 
factor (GM-CSF) and interleukin (IL) -4 m vitro yields cells that exhibit functional and morphological 
characteristics equivalent to immature dendritic cells found in vivo. Moreover, incubation in vitro of 
immature dendritic cells with tumor necrosis factor alpha (TNF-a), CD40 ligand, LPS, or monocyte- 
conditioned medium yields mature dendritic cells that are potent activators of naive T cells. 

The ability to manipulate DC m vitro and theur capacity to mount an effective immune 
response with small numbers of DC and little antigen has led to potential immunotherapies for 
diseases such as cancer, AIDS, and infectious diseases; and enhancing vaccine efficacy. Spontaneous 
remissions of particular cancers such as renal cell carcinomas and melanomas mdicate that the 
immune system can respond to tumor antigens and eliminate tumors. However, tumors escape 
immune surveillance through a number of means including secretion of IL-10, macrophage colony 
stimulating factor, IL-6, and vascular endothelial growth factor, all of wliich mhibit DC activity and 
promote tolerance of tumor tissue. Delivery of tumor antigen-loaded DC to tumors can induce tumor- 
specific rejection in animal models. Similarly, pathogens can escape immune surveillance by altering 
antigen processing and presentation pathways or interfering with maturation of antigen presenting 
cells. Rather than providing resistance, DC can complicate infection by hosting latent viruses such as 
Kaposi's virus and cytomegalovirus, complicating infection. HTV-l and measles virus particles are 
efficiently produced in DC. Vaccines against tumors or infectious pathogens could be improved by 
systemic or local administration of DC loaded with tumor antigens or attenuated viral particles or 
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components, respectively. 

The expression of killer-inliibitor regulatory molecules, chemokines, chemokine receptors, 
and proteinases have been identified in DC through sequencing of ESTs. Continuing this search may 
reveal new lymphocyte-binding and antigen-processing molecules, transmembrane and secretory 
5 products, and transcription factors that may help to explain the specialized features of DC and allow 
manipulation of the immune system. 
Tumor profession 

Endometrial cancer is the most common gynecologic cancer. Approximately 90% of 
endometrial cancers are epithelial in origin, and 90% of these cancers are classified as endometrial 

10 adenocarcinomas. Estrogen appears to act as a tumor promoter in endometrial tissue. Evidence indi- 
cates that p53 and Ki-ras are mutated in endometrial cancer. However, these mutations occur in a 
small percentage of cases and do not appear to be the initiatmg events m the disease. In addition, most 
chromosomes contain regions of aUelic loss in endometrial cancer, indicatmg that many genes may be 
affected in this disease. 

15 Biopharm acological tools for microarrav analysis 

Human umbilical vein endothelial cells (HUVECs) are a primary cell Ime derived from the 
endothelium of the human umbilical vein. HUVECs are used to study the functional biology of 
human endothelial cells in vitro. Activation of vascular endothelium is observed in physiological and 
pathophysiological processes including vascular tone regulation, coagulation and thrombosis, 

20 atherosclerosis, and inflammation. 

Tumor necrosis factor-alpha (TNF-a ) is a pleiotropic cytokine that plays a central role in 
mediation of the inflammatory response through activation of multiple signal transduction pathways. 
TNF-a is produced by activated lymphocytes, macrophages, and other white blood cells, and activates 
endothelial cells. 

25 PMA is an agonist of protein kinase C (PKC). PKC is a calcium-activated, phosphoUpid- 

dependent serine- and threonme-specific kinase that, upon activation, phosphorylates a broad range of 
secondary targets. TNF-a causes translocation of PKC from the cytosol to the membrane where it 
phosphorylates a variety of targets. 

Mterleukin 1 beta (IL-ip)is a cytokine associated with acute mflammatory responses and is 

30 involved in processes such as fever induction, metaboUc regulation, and bone remodeling. Botii cells 
of the unmune system (monocytes, dendritic ceUs, NK cells, platelets, and neutrophils) and somatic 
cells (osteoblasts, neurons, Schwann cells, oligodendrocytes, and adrenal cortical ceUs) can produce 
IL-ip. IL-lp can induce its own production in monocytes, the production of adhesion molecules and 
chemokines in endothelial cells, and interferon gamma (IFN^) production by NK ceUs in conjunction 

35 with lL-12. IL-1 is produced as a single chain pro-molecule that must be cleaved by a specialized 
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protease - E.-1 Converting Enzyme (ICE) - to acquire its function. Interleukin 10 (DL-IO) is 
produced by CD4+ T cell clones and some CD8+ T cell clones. Human B cells, EBV-transformed 
lymphoblastoid cell lines, and monocytes can also produce IL-10 upon activation. IL-10 is a 
pleiotrophic cytokine that can exert either immunostimulatory or immunosupressive eflfects on a 
5 variety of cell types. It is a potent immunosuppressant of macrophage fiinctions. In vitro, IL-10 can 
inhibit the accessory function and antigen-presenting capacity of monocytes by, among other effects, 
downregulating class H MHC expression. Thus, BL-IO can inhibit monocyte/macrophage-dependent, 
antigen specific proliferation of mouse Thl clones as well as human ThO-, Thl-, and Th2- like T 
cells. IL-IO can also inhibit the monocyte/macrophage-dependent, antigen stunulated cytokine 

10 synthesis (especially IFN-g) by human PBMNC and NK. Additionally, IL-10 is a potent inhibitor of 
monocyte/macrophage activation and its resultant cytotoxic effects. It can suppress the production of 
numerous cytokines including TNF-a, IL-1, IL-6, and IL-10, as well as the synthesis of superoxide 
anion, reactive oxygen intermediates, and reactive nitrogen mtermediates by activated monocytes/ 
macrophages. As an immunostimulatory cytokine, IL-10 can act on B cells to enhance thek viabiUty, 

15 cell proliferation, Ig secretion, and class H MHC expression. Aside firom B-lymphocytes, IL-10 is 
also a growth co-stimulator for thymocytes and mast cells, as well as an enhancer of cytotoxic T cell 
development. 

Thiazolidinediones or peroxisome proliferator-activated receptor y (PPAR-y) agonists are a 
new class of antidiabetic agents that improve insulin sensitivity and reduce plasma glucose and blood 
20 pressure in subjects with type H diabetes. These agents can bind and activate an orphan nuclear 
receptor and some of them induce human adipocyte differentiation. 

RNA Expression 

Array technology can provide a simple way to explore the expression of a smgle polymorphic 
gene or the expression profile of a large number of related or umelated genes. When the expression 
25 of a single gene is examined, arrays are employed to detect the expression of a specific gene or its 
variants. When an expression profile is examined, arrays provide a platform for examming which 
genes are tissue specific, carrying out housekeeping ftmctions, parts of a signaUng cascade, or 
specifically related to a particular genetic predisposition, condition, disease, or disorder. 

The potential application of gene expression profiling is particularly relevant to unproving 
30 diagnosis, prognosis, and treatment of disease. For example, both the levels and sequences expressed 
m tissues fiom subjects with a cardiovascular disorder may be con^ared with the levels and 
sequences expressed in normal tissue. 

Atherosclerosis and the associated coronary artery disease and cerebral stroke represent the 
most conomon cause of death in mdustrialized nations. Although certain key risk factors have been 
35 identified, a full molecular characterization that elucidates the causes and provide care for this 
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complex disease has not been achieved. Molecular characterization of growth and regression of 
atherosclerotic vascular lesions requires identification of the genes that contribute to features of the 
lesion including growth, stability, dissolution, rupture and, most lethally, induction of occlusive 
vessel thrombus. Vascular lesions principally involve the vascular endothelium and the surrounding 
5 smooth muscle tissue. 

Development of atherosclerosis is understood to be induced by the presence of circulating 
lipoprotein. Lipoproteins, such as the cholesterol-rich low-density lipoprotein (LDL), accumulate in 
the extracellular space of the vascular intima, and undergo modification. Oxidation of LDL (Ox- 
LDL) occurs most avidly in the sub-endothelial space where circulating antioxidant defenses are less 

10 effective. Mononuclear phagocytes enter the intima, differentiate into macrophages, and ingest 
modified Upids including Ox-LDL. DurioQg Ox-LDL uptake, macrophages produce cytokines (e.g. 
tumor necrosis factor a (TNF-a) and uiterleukm-1 (^-1)) and growth factors (e.g. M-CSF, VEGF, 
and PDGF-BB) that eUcit further cellular events that modulate atherogenesis such as smooth muscle 
cell proliferation and production of extracellular matrix by vascular endothelium. Additionally, these 

15 macrophages may activate genes in endothelium and smooth muscle tissue mvolved in inflammation 
and tissue differentiation, including superoxide dismutatse (SOD), JL-8, and ICAM-1. 

The vascular endothelium influences not only the three classically mteracting components 
of hemostasis: the vessel, the blood platelets and the clotting and fibrinolytic systems of plasma, but 
also the natural sequelae: inflammation and tissue repair. Two principal modes of endothelial 
behavior may be differentiated, best defined as an anti- and a prothrombotic state. Under 
physiological conditions endothelium mediates vascular dilatation (formation of nitric oxide (NO), 
PGI2, adenosine, hyperpolarising factor), prevents platelet adhesion and activation (production of 
adenosine, NO and PGI^, removal of ADP), blocks thrombin formation (tissue factor pathway 
mhibitor. activation of protein C via thrombomodulin, activation of antithrombin HI) and mitigates 
fibrin deposition (t- and scuplasminogen activator production). Adhesion and transmigration of 
inflammatory leukocytes are attenuated, e.g. by NO and IL-10, and oxygen radicals are efficientiy 
scavenged (urate, NO, glutathione, SOD). 

When the endothelium is physically dismpted or functionally perturbed by postischemic 
reperfusion, acute and chronic inflammation, atherosclerosis, diabetes and chronic arterial 
hypertension, then completely opposing actions pertain. This protiurombotic, promflammatory state is 
characterised by vaso-constiiction, platelet and leukocyte activation and adhesion (extemalisation, 
expression and upregulation of, for ex&mple, von Willebrand factor, platelet activating factor, 
P-selectin. ICAM-1, IL-8, MCP-1, and TNF-a), promotion of thrombin formation, coagulation and 
fibrin deposition at the vascular wall (expression of tissue factor, PAI-1, and phosphatidyl serine) and, 
in platelet-leukocyte coaggregates, additional inflammatory interactions via attachment of platelet 
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CD40-ligand to endothelial, monocyte and B-cell CD40. SiBce thrombin formation and inflammatory 
stimulation set the stage for later tissue repair, complete abolition of such endothelial responses 
cannot be the goal of clinical interventions aimed at limitmg procoagulatory, prothromfaotic actions of 
a dysfiinctional vascular endothelium. (See, e.g., Becker et al. (2000) Z Kardiol 89:160-167.) 
5 Tumor necrosis factor a is a pleiotropic cytokine that a mediates immune regnlation and 

inflammatory responses. TOP-a-related cytokines generate partially overlapping ceUular responses, 
including differentiation, proliferation, nuclear factor-KB (NF-kB) activation, and cell death, by 
triggering the aggregation of receptor monomers (Smith, C.A. et al. (1994) Cell 76:959-962). The 
ceUular responses triggered by TNF-a are initiated through its interaction with distinct cell surface 
10 receptors (TNFRs). NF-kB is a transcription factor with a pivotal role in inducing genes involved in 
physiological processes as well as in the response to injury and infection. Activation of NF-kB 
involves the phosphorylation and subsequent degradation of an inhibitory protein, KB.and many of 
the proximal kinases and adaptor molecules involved m this process have been elucidated. 
AdditionaUy, the NF-kB activation pathway from cell membrane to nucleus for IL-1 and TNF-a is 
15 now understood (Bowie and ONeiU (2000) Biochem Pharmacol 59: 13-23). 

Monocyte chemoattractant protein-l (MCP-1) is known to play an important role in the 
pathogenesis of atherosclerosis by inducing monocyte migration. TNF-a treatment of human 
umbilical vein endotheUal cells (HUVECs) increased the cellular secretions of MCP-1 1 19.fold 
compared with untreated cells. Troglitazone, an insuhn-sensitizing drug, significantly inhibited this 
TNF-a-induced increase in MCP-1 secretions and decreased mRNA levels (Ohta et al. (2000) 
Diabetes Res Clin Pract 48:171-176). 

Treatment of confluent cultures of vascular smoodi muscle cells (SMCs) with TNF-a 
suppresses the incorporation of pH]proline into both coUagenase-digestible proteins (CDP) and 
noncoUagenous proteins (NCP). Such suppression by TNF-a is not observed in confluent bovme 
aortic endothelial cells and human fibroblastic IMR-90 cells. TNFhx decreases the relative proportion 
of collagen types IV and V suggesting that TNF-a modulates collagen synthesis by SMCs dependmg 
on their cell density and therefore may modify formation of atherosclerotic lesions (Hiraga et al 
(2000) Life Sci 66:235-244). 

Hmnan coronaiy artery smooth muscle cells (CASMC) are primary ceUs isolated from the 
tunica media (an intermediate muscular layer) of a human coronary artery. Vascular smooth muscle 
ceUs are a model of increasmg significance m vascular biology. It is now well known that besides 
their obvious role in the regulation of vascular tone and, consequently, oxygen supply to various 
tissues, thek behavior mider inflammatory conditions is an important factor m the development of 
atherosclerosis and restenosis. 

Human aortic endotheUal cells (HAECs) are primary cells derived from the endothelium of a 
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human aorta. HAECs have been used as an experimental model for investigating in vitro the role of 
the endothelium in human vascular biology. Activation of the vascular endothelium is considered to 
be a central event in a wide range of both physiological and pathophysiological processes, such as 
vascular tone regulation, coagulation and thrombosis, atherosclerosis, and inflammation. 
5 Thus, vascular tissue genes differentially expressed during treatment of CASMC and HAEC 

cell cultures with TNFa may reasonably be expected to be markers of the atherosclerotic process. 
Steroid Hormones 

Glucocorticoids are naturally occurring hormones that prevent or suppress inflammation and 
immune responses when administered at pharmacological doses. At the molecular level, unbound 

10 glucocorticoids readily cross cell membranes and bind with high affinity to specific cytoplasmic 
receptors. Subsequent to binding, transcription and, ultimately, protem synthesis are affected. The 
result can include mhibition of leukocyte infiltration at the site of inflammation, interference in the 
function of mediators of inflammatory response, and suppression of humoral immune responses. The 
antiinflammatory actions of corticosteroids are thought to involve phosphoUpase A2 inhibitory 

15 proteins, collectively called lipocortins. Lipocortins, in turn, control the biosynthesis of potent 
mediators of inflammation such as prostaglandins and leukotrienes by inhibitmg the release of the 
precursor molecule arachidonic acid. Further, corticosteroids inhibit eosinophil, basophil, and airway 
epithelial cell function by regulation of cytokines that mediate the inflammatory response. They 
inhibit leukocyte infiltration at the site of inflammation, mterfere m the function of mediators of the 

20 inflammatory response, and suppress the humoral immune response. Corticosteroids are used to treat 
allergies, asthma, arthritis, and skm conditions. Beclomethasone is a synthetic glucocorticoid that is 
used to treat steroid-dependent asthma, to relieve symptoms associated with allergic or nonallergic 
(vasomotor) rhinitis, or to prevent recurrent nasal polyps following surgical removal. The anti- 
inflammatory and vasoconstrictive effects of intranasal beclomethasone are 5000 times greater than 

25 those produced by hydrocortisone. 
Colon Cancer 

Colorectal cancer is the second leadmg cause of cancer deaths in the United States. Colon 
cancer is associated with aging, since 90% of the total cases occur in individuals over the age of 55. 
A widely accepted hypothesis is that several contributing genetic mutations must accumulate over 

30 tinae m an individual who develops the disease. To understand the nature of genetic alterations in 
colorectal cancer, a number of studies have focused on the inherited syndromes. The first known 
inherited syndrome. Familial Adenonoatous Polyposis (FAP), is caused by mutations in the 
Adenomatous Polyposis Coli gene (APC), resulting in truncated or inactive forms of the protein. This 
tumor suppressor gene has been mapped to chromosome 5q. Patients with FAP develop numerous 

35 colon polyps at an early age, usually by late adolescence. Colon polyps are believed to be precursors 
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to adenocarcinoma of the colon. If left untreated, almost all carriers of the mutated gene will develop 
colorectal cancer. The mutated gene associated with this syndrome is known as APC (adenomatous 
polyposis coli). The second known inherited syndrome is hereditary nonpolyposis colorectal cancer 
(HNPCC), which is caused by mutations in mismatch repair genes. 
5 Although hereditary colon cancer syndromes occur in a small percentage of the population 

and most colorectal cancers are considered sporadic, knowledge from studies of the hereditary 
syndromes can be generally applied. For instance, somatic mutations in APC occur in at least 80% of 
indiscriminate colon tumors. APC mutations are thought to be the initiating event in the disease. 
Other mutations occur subsequently. Approximately 50% of colorectal cancers contain activating 
10 mutations in ras, while 85% contain inactivating mutations in p53. Changes in these genes lead to 
gene expression changes in colon cancer. Less is understood about downstream targets of these 
mutations and the role they may play in cancer development and progression. 
Parkinson's disease 

Parkinson's disease (PD) is a neurodegenerative disorder characterized by the progressive 
15 degeneration of the dopaminergic nigrostriatal pathway, and the presence of Lewy bodies. Genetic 
linkages to chromosomes 2p4, 4p5, and three loci on lq6-8 have been identified (Gwinn-Hardy K. 
(2002) Mov. Disord. 17:645-656). Qinical disorders classified as parkinsonism include PD, dementia 
with Lewy bodies (DLB), progressive supranuclear palsy (PSP), and essential tremor. Several 
neurodegenerative diseases share share pathogenic mechanisms involving tau or synuclein 

20 aggregation. These disorders mclude Alzheimer's disease, and Pick's disease as well as PD and 

progressive supranuclear palsy (Hardy, J. (2001) J. Alzhehners Dis. 3:109-116). Several genetically 
distinct forms of PD can be caused by mutations in single genes. Genes for monogenically inherited 
forms of Parkinson's disease have been mapped and/or cloned. Li sohk families with autosomal 
dominant inheritance and typical Lewy-body pathology, mutations have been identified m the gene 

25 for alpha-synuclein. Aggregation of this protem in Lewy-bodies may be a crucial step in the 

molecular pathogenesis of familial and sporadic PD. On the other hand, mutations in the parkin gene 
cause early-onset autosomal recessive parkinsonism in which nigral degeneration is not accompanied 
by Lewy-body formation. Parkm-mutations appear to be a common cause of PD in patients with very 
early onset. Parkin has been implicated in the cellular protein degradation pathways, as it has been 

30 shown that it functions as a ubiquitin ligase. A mutation in the gene for ubiquitin C-terminal 

hydrolase Llin this pathway has been identified in another small family with PD. Other loci have 
been mapped to chromosome 2p and 4p, respectively, in families with dominantly mherited PD. 
These early-onset forms differ from the common sporadic form of PD. It is widely believed that a 
combination of interacting genetic and environmental causes may be responsible in the majority of 

35 PD-cases (Gasser, T. (2001) J. Neurol. 2001 248:833-840). 
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There is a need in the art for new compositions, including nucleic acids and proteins, for the 
diagnosis, prevention, and treatment of ceU proliferative disorders including cancer, developmental 
disorders, neurological disorders, autoimmune/inflammatory disorders, reproductive disorders, and 
disorders of the placenta. 



SUMMARY OF THE INVENTION 

Various embodiments of the invention provide purified polypeptides, proteins associated with 
cell growth, differentiation, and death, referred to coUectively as 'CGDD' and individually as 
'CGDD-1,' 'CGDD-2,' 'CGDD-3,' 'CGDD-4,' 'CGDD-5,' 'CGDD-6,' 'CGDD-7,' 'CGDD-8,' 
'CGDD-9,' 'CGDD-10,' 'CGDD-11,' 'CGDD-12,' 'CGDD-13,' 'CGDD-14,' 'CGDD-15,' 'CGDD- 
16,' 'CGDD-17,' 'CGDD-18,' 'CGDD-19,' 'CGDD-20,' 'CGDD-21,' 'CGDD-22,' 'CGDD-23,' 
'CGDD-24,' 'CGDD-25,' 'CGDD-26,' 'CGDD-27,' 'CGDD-28,' 'CGDD-29,' 'CGDD-30,' 'CGDD- 
31,' 'CGDD-32,' and 'CGDD-33' and methods for using these proteins and their encoding 
polynucleotides for the detection, diagnosis, and treatment of diseases and medical conditions. 
Embodunents also provide methods for utiUzing the purified proteins associated with ceU growth, 
differentiation, and death and/or their encoding polynucleotides for facilitating the drug discovery 
process, including determination of efficacy, dosage, toxicity, and pharmacology. Related 
embodiments provide methods for utilizing the purified proteins associated with cell growth, 
differentiation, and death and/or then encoding polynucleotides for mvestigating the pathogenesis of 
diseases and medical conditions. 

An embodiment provides an isolated polypeptide selected from the group consisting of a) a 
polypeptide comprismg an amino acid sequence selected from the group consisting of SEQ ID NO: 1- 
33, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical or at 
least about 90% identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-33, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consistmg of SEQ ID NO: 1-33, and d) an immunogenic fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-33. Another 
embodiment provides an isolated polypeptide comprismg an amino acid sequence of SEQ ID 
NO: 1-33. 

Stni another embodiment provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 
group consistmg of SEQ ID NO:l-33, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical or at least about 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-33, c) a biologicaUy active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-33, and d) an 
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immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-33. In another embodiment, the polynucleotide encodes a polypeptide 
selected from the group consisting of SEQ ID NO: 1-33. In an alternative embodiment, the 
polynucleotide is selected firom the group consisting of SEQ ID NO:34-66. 

Still another embodiment provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-33, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-33, c) a biologically active fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-33, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-33. Another embodiment provides a cell transformed with the recombinant polynucleotide. 
Yet another embodiment provides a ttansgenic organism comprising the recombinant polynucleotide. 

Another embodiment provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1 -33, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an ammo acid sequence selected from the group 
consisting of SEQ ID NO: 1-33, c) a biologically active fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-33, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-33. The method comprises a) culturing a cell under conditions suitable for expression of the 
polypeptide, wherein said cell is transformed with a recombinant polynucleotide comprising a 
promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) recovering 
the polypeptide so expressed. 

Yet another embodiment provides an isolated antibody which specifically bmds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-33, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-33, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-33, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-33. 

Still yet another embodiment provides an isolated polynucleotide selected from the group 
consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
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consisting of SEQ ED NO:34-66. b) a polynucleotide comprising a naturally occurring polynucleotide 
sequence at least 90% identical or at least about 90% identical to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:34-66, c) a polynucleotide complementary to the 
polynucleotide of a), d) a polynucleotide complementary to the polynucleotide of b). and e) an RNA 
equivalent of a)-d). In other embodiments, the polynucleotide cm comprise at least about 20, 30. 40, 
60, 80, or 100 contiguous nucleotides. 

Yet another embodiment provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide being selected from the group consisting of a) a polynucleotide comprising 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:34-66, b) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:34-66, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
15 comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and b) 
detecting the presence or absence of said hybridization complex. In a related embodiment, the 
method can include detecting the amount of the hybridization complex, hi still other embodiments, 
20 the probe can comprise at least about 20, 30, 40, 60, 80, or 100 contiguous nucleotides. 

Still yet another embodiment provides a method for detecting a target polynucleotide m a 
sample, said target polynucleotide being selected from the group consisting of a) a polynucleotide 
comprising a polynucleotide sequence selected from the group consistmg of SEQ ID NO:34-66, b) a 
polynucleotide comprismg a naturally occurring polynucleotide sequence at least 90% identical or at 
25 least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:34-66, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof usmg polymerase chain 
reaction amplification, and b) detecting the presence or absence of said ampUfied target 
30 polynucleotide or fragment thereof. In a related embodiment, the method can include detecting the 
amount of the amplified target polynucleotide or fragment thereof. 

Another embodiment provides a composition comprising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-33, b) a polypeptide comprismg a 
35 naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
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amino acid sequence selected from the group consisting of SEQ ID NO: 1-33, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-33, and d) an inuminogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-33, and a pharmaceutically acceptable excipient. 
In one embodiment, the composition can comprise an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-33. Other embodiments provide a method of treating a disease or 
condition associated with decreased or abnormal expression of functional CGDD, comprising 
administering to a patient in need of such freatment the composition. 

Yet another embodiment provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ED NO: 1-33, b) a polypeptide conqjrising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-33, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-33, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 -33. The method comprises a) exposing a sanq)le 
comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. Another 
embodiment provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient. Yet another embodiment provides a method of treating a 
disease or condition associated with decreased expression of functional CGDD, comprising 
administering to a patient in need of such treatment the composition. 

Still yet another embodiment provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected from the group consisting of a) a pol3^eptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-33, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical or at least about 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-33, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-33, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-33. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in 
the sample. Another embodiment provides a composition comprising an antagonist compound 
identified by the method and a pharmaceutically acceptable excipient. Yet another embodiment 
provides a method of freating a disease or condition associated with overexpression of fiinctional 
CGDD, comprising administering to a patient in need of such treatment the composition. 

Another embodiment provides a method of screening for a compound that specifically binds 
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to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-33, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-33, c) a biologically active 
5 fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-33, and d) an inomunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consistmg of SEQ ID NO: 1-33. The method comprises a) combining the 
polypeptide with at least one test compound under suitable conditions, and b) detecting binding of the 
polypeptide to the test compound, thereby identifying a conq)ound that specifically binds to the 
10 polypeptide. 

Yet another embodiment provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-33, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 

15 amino acid sequence selected from the group consisting of SEQ ID NO: 1-33, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-33, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-33. The method comprises a) combining the 
polypeptide with at least one test compound under conditions permissive for the activity of the 

20 polypeptide, b) assessing the activity of the polypeptide in the presence of the test compound, and c) 
conq)aring the activity of the polypeptide in the presence of the test compound with the activity of the 
polypeptide in the absence of the test compound, wherein a change in the activity of the polypeptide 
in the presence of the test compound is indicative of a compound that modulates the activity of the 
polypeptide. 

25 Still yet another embodiment provides a method for screening a compound for effectiveness 

in altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:34-66, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing the expression of the target 

30 polynucleotide in the presence of varying amounts of the conqraund and in the absence of the 
compound. 

Another embodiment provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sanqjle with a probe comprising at least 20 
35 contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
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comprising a polynucleotide sequence selected from the group consisting of SEQ E) NO:34-66, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ K> 
NO: 34-66, iii) a polynucleotide having a sequence complementary to i), iv) a polynucleotide 
5 complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs 
under conditions whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide selected from the group consisting 
of i) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:34-66, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at 

10 least 90% identical or at least about 90% identical to a polynucleotide sequence selected from the 
group consisting of SEQ ID NO:34-66, iii) a polynucleotide complementary to the polynucleotide of 
i), iv) a polynucleotide complementary to the polynucleotide of ii), and v) an RNA equivalent of i)- 
iv). Alternatively, the target polynucleotide can comprise a fragment of a polynucleotide selected 
from the group consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) 

15 comparing the amount of hybridization complex in the treated biological sample with the amount of 
hybridization complex in an untreated biological sample, wherein a difference in the amount of 
hybridization complex in the treated biological sanq)le is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 
20 Table 1 summarizes the nomenclature for full length polynucleotide and polypeptide 

embodiments of the invention. 

Table 2 shows the GenBank identification number and aimotation of the nearest GenBank 
homolog, and die PROTEOME database identification numbers and annotations of PROTEOME 
database homologs, for polypeptide embodiments of the invention. The probability scores for the 
25 matches between each polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide embodirr^nts, mcluding predicted motifs and 
domains, along with the methods, algorithms, and searchable databases used for analysis of the 
polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
30 polynucleotide embodiments, along with selected fragments of the polynucleotides. 

Table 5 shows representative cDNA libraries for polynucleotide embodiments. 

Table 6 provides an appendix which describes the tissues and vectors used for construction of 
the cDNA Ubraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze polynucleotides and 
35 polypeptides, along with applicable descriptions, references, and threshold parameters. 
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Table 8 shows single nucleotide polymorphisms found in polynucleotide sequences of the 
invention, along with allele frequencies in different human populations. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleic acids, and methods are described, it is understood that 
embodiments of the uivention are not limited to the particular machines, instruments, materials, and 
methods described, as these may vary. It is also to be understood that the tenninology used herein is 
for the purpose of describing particular embodiments only, and is not intended to limit the scope of 
the invention. 

As used herein and in the appended claims, the singular forms "a," "an," and "the" include 
plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a 
host cell" includes a plurality of such host ceUs, and a reference to "an antibody" is a reference to one 
or more antibodies and equivalents thereof known to those skilled in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanmgs as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to tiiose described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported m the publications and which might 
be used in connection with various embodiments of the invention. Nothing herein is to be construed 
as an admission that the invention is not entitled to antedate such disclosure by virtue of prior 
invention. 
DEFINITIONS 

"CGDD" refers to the amino acid sequences of substantially purified CGDD obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
CGDD. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of CGDD either by directly interacting with 
CGDD or by acting on components of the biological pathway m which CGDD participates. 

An "allelic variant" is an alternative form of the gene encoding CGDD. Allehc variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturaUy occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 



47 



wo 03/050253 



PCT/US02/39133 



Each of these types of changes may occur alone, or in combination with the othere. one or more times 

in a given sequence. 

"Altered" nucleic acid sequences encoding CGDD include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as CGDD or a 
5 polypeptide with at least one functional characteristic of CGDD. Included within this defoiition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding CGDD, and improper or unexpected hybridization to allelic variants, 
with a locus other than die normal chromosomal locus for the polynucleotide encoding CGDD. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of 

10 amino acid residues which produce a silent change and result in a functionally equivalent CGDD. 
Deliberate amino acid substitutions may be made on the basis of one or more similarities in polarity, 
charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as 
long as tlie biological or immunological activity of CGDD is retained. For example, negatively 
charged amino acids may include aspartic acid and glutamic acid, and positively charged amino acids 

15 may include lysine and arginine. Amino acids with uncharged polar side chains having similar 

hydrophilicity values may include: asparagme and glutamine; and serine and threonine. Amino acids 
with uncharged side chains having similar hydrophilicity values may include: leucine, isoleucine, and 
valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" can refer to an oligopeptide, a peptide, a 

20 polypeptide, or a protein sequence, or a fragment of any of these, and to natiirally occurring or 
synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 
occurring protein molecule, "amino acid sequence" and like terms are not meant to limit die amino 
acid sequence to the con^)lete native amino acid sequence associated with the recited protein 
molecule. 

25 "An^lification" relates to the production of additional copies of a nucleic acid. 

An5)lification may be carried out using polymerase chain reaction (PCR) technologies or otiier 
nucleic acid amplification technologies well knovra in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of CGDD. Antagonists may include proteins such as antibodies, anticalins, nucleic acids, 
carbohydrates, small molecules, or any other con5)ound or composition which modulates the activity 
of CGDD either by durecUy interacting with CGDD or by acting on components of the biological 
pathway m which CGDD participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab')2. and Fv fragments, which are capable of bmdmg an epitopic determinant. 
Antibodies that bind CGDD polypeptides can be prepared using intact polypeptides or using 
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fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 
oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bmd specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment), described in U.S. Patent No. 
5,270,163), which selects for target-specific aptamer sequences firom large combinatorial libraries. 
Aptamer compositions may be double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. 
The nucleotide components of an aptamer may have modified sugar groups (e.g., the 2'-0H group of a 
ribonucleotide may be replaced by 2'-F or 2'-NH2), which may improve a desked property, e.g., 
resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
e.g., a high molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 
cross-linker (Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13). 

The term "intramer" refers to an aptamer which is expressed in vivo. For example, a vaccinia 
virus-based RNA expression system has been used to express specific RNA aptamers at high levels in 
the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl. Acad. Sci. USA 96:3606-3610). 

The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 
substrates containing right-handed nucleotides. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(codmg) strand of a polynucleotide having a specific nucleic acid sequence. Antisense compositions 
may include DNA; RNA; peptide nucleic acid (PNA); oligonucleotides havmg modified backbone 
linkages such as phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides 



49 



wo 03/050253 PCT/US02/39133 
having modified sugar groups such as 2'-xnethoxyethyl sugars or 2'-methoxyethoxy sugars- or 
ohgonucleotides having modified bases such as 5-methyl cytosine, 2--deoxyuracil, or 7-deaza-2'- 
deoxyguanosine. Antisense molecules n,ay be produced by any method including chemical synthesis 
or transcription. Once introduced into a cell, the complementary antisense molecule base-pairs with a 
5 naturally occurring nucleic acid sequence produced by the cell to form duplexes which block either 
transcription or translation. The designation "negative" or "minus" can refer to the antisense strand 
and the designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule ' 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occmring molecule. Likewise, "immunologically active" or "immunogenic" 
10 refers to the capability of the natural, recombinant, or synthetic CGDD, or of any oligopeptide 

thereof, to induce a specific immune response in appropriate animals or ceUs and to bind with specific 
antibodies. 

"Complementary" describes the relationship between two smgle-stranded nucleic acid 
sequences that amieal by base-pairing. For example. 5--AGT-3' pairs with its conq,lement. 
15 3'-TCA-5'. 

A "composition comprising a given polynucleotide" and a "composition comprising a given 
polypeptide" can refer to any composition containing the given polynucleotide or polypeptide The 
con^osition may con^rise a dry fonnulation or an aqueous solution. Compositions comprising 
polynucleotides encoding CGDD or fragments of CGDD may be employed as hybridization probes 
20 Tt. probes may be stored in freeze-dried form and may be associated with a stabilizing age.t such as 
a carbohyd^te. m hybridizations, the probe may be deployed in an aqueous solution containing salts 
(e.g.. NaCI), detergents (e.g.. sodium dodecyl sulfate; SDS), and other components (e.g.. Denhardt's 
solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (AppUed 
Biosystems. Foster City CA) in the 5' and/or the 3' direction, and ^sequenced, or which has been 
assembled from one or more overlapping cDNA. EST. or genomic DNA fragments using a computer 
program for fragment assembly, such as the GELVIEW fragment assembly system (Accelrys 
Burlington MA) or Phxap (University of Washington. Seattle WA). Some sequences have beln both 
extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein. i.e., the structure and especiaUy the function of 
the protein is conserved and not significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are regarded 
35 as c(Hiservative ammo acid substitutions. 
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Original Residue 



10 



15 



20 



Ala 
Arg 
Asn 
Asp 
Cys 
Gin 
GIu 
Gly 
His 
He 
Leu 
Lys 
Met 
Phe 
Ser 
Thr 
Trp 
Tyr 
Val 



Conservative Substitution 



Gly, Ser 
His, Lys 
Asp, Gin, His 
Asn, Glu 
Ala, Ser 
Asn, Glu, His 
Asp, Gin, His 
Ala 

Asn, Arg, Gin, Glu 
Leu, Val 
He, Val 
Arg, Gin. Glu 
Leu, lie 

His, Met, Leu, Trp, Tyr 

Cys, Thr 

Ser, Val 

Phe, Tyr 

His, Phe, Trp 

Be, Leu. Thr 
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Conservative atnino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical confonnation 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide 
O^emacal modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
retams at least one biological or immunological function of the natural molecule. A derivative 
polypeptide is one modified by glycosylation, pegylation, or any similar process that retains at least 
one biological or immunological function of the polypeptide from ^vhich it was derived 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
35 measurable signal and is covalentiy or noncovalendy joined to a polynucleotide or polypeptide 

"Drfferential expression" refers to mcreased or upregulated; or decreased, downregulated or 
absent gene or protein expression, determined by comparing at least two different samples Such ' 
comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a nonaal sample. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 
exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
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evolution of new protein functions. 

A "fragment" is a unique portion of CGDD or a polynucleotide encoding CGDD which can 
be identical in sequence to. but shorter m length than, the parent sequence. A fragment may comprise 
up to the entire length of the defined sequence, manus one nucleotide/amino acid residue For 
example, a fragment may comprise from about 5 to about 1000 contiguous nucleotides or amino acid 
residues. A fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes 
may be at least 5. 10. 15. 16. 20. 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous ' 
nucleotides or amino acid residues in length. Fragments may be preferentiaUy selected from certain 
regions of a molecule. For example, a polypeptide fragment may comprise a certain length of 
contiguous amino acids selected from the first 250 or 500 ammo acids (or f.st 25% or 50%) of a 
polypeptide as shown in a certain defined sequence. Clearly these lengths are exemplary, and any 
length that is supported by the specification, includmg the Sequence Listing, tables, and figures may 
be encompassed by the present embodiments. 

A fragment of SEQ ID NO:34-66 can comprise a region of unique polynucleotide sequence 
that specifically identifies SEQ ID NO:34-66. for exaix^le, as distmct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:34-66 can be employed in 
one or more embodiments of methods of the mvention. for example, in hybridization and 
amplification technologies and in analogous methods that distinguish SEQ ID NO:34-66 from related 
polynucleotides. The precise length of a fragment of SEQ ID NO:34-66 and die region of SEQ ID 
20 NO:34-66 to which the fragment corresponds are routinely detenninable by one of ordinary skill in 
the art based on the intended purpose for the fragment. 

A fiagmem of SEQ ID NO:l-33 is encoded by a fiagment of SEQ ID NO:34-66. A fragment 
of SEQ ID NO:l-33 can comprise a region of Clique amino acid sequence that specifically identifies 
SEQ ID NO:l-33. For example, a fragment of SEQ ID NO: 1-33 can be used as an immunogenic 
peptide for the development of antibodies that specificaUy recogni^ SEQ ID NO: 1-33. The precise 
length of a fragment of SEQ ID NO:1.33 and the region of SEQ ID NO:l-33 to which the fragment 
corresponds can be determined based on the intended purpose for the fragment using one or more 
analytical methods described herein or otherwise known in the art. 

A 'fuU length" polynucleotide is one containing at least a franslation mitiation codon (e g 
tnethiomne) foUowed by an open reading frame and a translation termination codon. A "Ml length" 
polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or. alternatively, sequence identity, between two or 
more polynucleotide sequences or two or more polypeptide sequences. 

The terms "pereent identity" and identity." as applied to polynucleotide sequences refer 
to the pereentage of identical residue matches between at least two polynucleotide sequences aligned 
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using a standardized algorithm. Such an algorithm may insert, in a standardized and reproducible 
way, gaps in the sequences being compared in order to optimize alignment between two sequences, 
and therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using one or more 
5 computer algorithms or programs known in the art or described herein. For example, percent identity 
can be determined using the default parameters of the CLUSTAL V algorithm as incorporated into the 
MEGALIGN version 3. 12e sequence alignment program This program is part of the LASERGENE 
software package, a suite of molecular biological analysis programs (DNASTAR, Madison WI). 
CLUSTAL V is described in Higgins, D.G. and P.M. Sharp (1989; CABIOS 5:151-153) and in 
10 ffiggins, D.G. et al. (1992; CABIOS 8:189-191). For pairwise alignments of polynucleotide 
sequences, the default parameters are set as follows: Ktuple-2, gap penalty=5, window=4, and 
"diagonals saved"^. The "weighted" residue weight table is selected as the default. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
which can be used is provided by the National Center for Biotechnology Information (NCBI) Basic 
15 Local Alignment Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403^10), 
which is available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nhn.nih.gov/BLAST/. The BLAST software suite includes various sequence 
analysis programs including "blastn," that is used to align a known polynucleotide sequence with 
other polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nhn.nih.gov/gorf/bl2.html. 
The "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 
Matrix: BWSUM62 
Reward for match: 1 
Penalty for mismatch: -2 
Open Gap: 5 and Extension Gap: 2 penalties 
Gap X drop-off: 50 
Ejq>ect: 10 
Word Size: 11 
Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example. 
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over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degenetacy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of identical residue matches between at least two polypeptide sequences aligned using 
a standardized algoridmi. Methods of polypeptide sequence alignment are weU-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generaUy preserve the charge and hydrophobicity at tiie 
site of substitution, tiius preserving the structure (and therefore fimction) of the polypeptide. The 
phrases "percent similarity" and "% similarity," as applied to polypeptide sequences, refer to the 
percentage of residue matches, including identical residue matches and conservative substitutions, 
between at least two polypeptide sequences aligned using a standardized algorithm. In contrast, 
conservative substitutions are not mcluded in tixe calculation of percent identity between polypeptide 
20 sequences. 

Percent identity between polypeptide sequences may be determined usmg die default 
parameters of the CLUSTAL V algorithm as incorporated into tiie MEGALIGN version 3.12e 
sequence aUgnment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V. the defauh parameters are set as follows: Ktuple=l, gap 
25 penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as die default 
residue weight table. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (ApriI-21-2000) with blastp set at default parameters. Such default parameters may be, for 
30 exanq)le: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap X drop-off: 50 
Expect: 10 
35 Word Size: 3 
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Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20. at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplaiy only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are Imear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that die antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pauing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in detennming the 
stringency of the hybridization process, with more stringent conditions aUowing less non-specific 
binding, i.e.. binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
conditions for amiealing of nucleic acid sequences are routinely determinable by one of ordinary skill 
in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC. about 
1% (w/v) SDS. and about 100 /tg/ml sheared, denatured salmon sperm DNA. 

CSenerally, stringency of hybridization is expressed, m part, with reference to the temperature 
under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20-0 lower than the thermal melting point (T J for the specific sequence at a defined ionic 
strength and pH. The T„ is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T„ and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. and D.W. 
Russell (2001; Molecular Cloning: A Labnratnry M.n.»i 3rd ed., vol. 1-3, Cold Spring Harbor Press, 
Cold Spring Harbor NY, ch. 9). 

High stringency conditions for hybridization between polynucleotides of the present 
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invention include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS. 
for 1 hour. Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. 
TypicaUy, blocking reagents are used to block non-specific hybridization. Such blocking reagents 
5 include, for instance, sheared and denatured salmon sperm DNA at about 100-200 /tg/ml. Organic 
solvent, such as formamide at a concentration of about 35-50% v/v, may also be used under particular 
circumstances, such as for RNA:DNA hybndizatioos. Useful variations on these wash conditions wiU 
be readily apparent to those of ordinary skill in die art. Hybridization, particularly under high 
stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 

10 similarity is strongly indicative of a sunilar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acids by 
virtue of the formation of hydrogen bonds between complementary bases. A hybridization complex 
may be formed in solution (e.g., Qt or R,t analysis) or formed between one nucleic acid present in 
solution and another nucleic acid immobDized on a solid support (e.g., paper, membranes, filters, 

15 chips, pins or glass slides, or any other appropriate substrate to which cells or their nucleic acids have 
been fixed). 

The words "insertion" and "addition" refer to chaiiges in an amino acid or polynucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively, 
"hnmrnie response" can refer to conditions associated with inflammation, trauma, immune 
20 disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fi-agment" is a polypeptide or oligopeptide fragment of CGDD which is 
capable of eUciting an immune response when introduced imo a living organism, for example, a 
mammal. The term "immunogenic fiagment" also includes any polypeptide or oligopeptide Segment 
OfCGDD which is useful m any of the antibody production methods disclosed herein or known in the 
art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, antibodies, or other chemical compounds on a substrate. 
30 The terms "element" and "array element" refer to a polynucleotide, polypeptide, antibody, or 

other chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of CGDD. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
biological, functional, or immunological properties ofCGDD. 
35 The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleodde, 
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polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-Uke or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubUity to the composition. 
PNAs preferentially bind complementary smgle stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an CGDD may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of CGDD. 

"Probe" refers to nucleic acids encoding CGDD, their complements, or fragments thereof, 
which are used to detect identical, allelic or related nucleic acids. Probes are isolated 
oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical labels 
include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are short 
nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a 
DNA polymerase enzyme. Primer pairs can be used for anq)Iification (and identification) of a nucleic 
acid, e.g., by the polymerase chain reaction (PGR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listmg, may be used. 

Methods for preparing and using probes and primers are described in, for example, 
Sambrook, J. and D.W. RusseU (2001; Molecular Cloning: A Laboratory Manual . 3rd ed., vol. 1-3, 
Cold Spring Harbor Press, Cold Spring Harbor NY), Ausubel, RM. et al. (1999; Short Protocols in 
Molecular Biology. 4* ed., John Wiley & Sons, New York NY), and Innis, M. et al. (1990; PGR 
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Protocols, A Guide to Methods and Ap plirations, Academic Press, San Diego CA). PGR primer pairs 
can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5. 1991, WMtehead Institute for Biomedical Research, Cambridge 
MA). 

5 Oligonucleotides for use as primers are selected using software known in the art for such 

purpose. For example, OLIGO 4.06 software is useful for the selection of PGR primer pairs of up to 
100 nucleotides each, and for die analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection programs have incorporated additional features for expanded capabilities. For example, the 
10 PrimOU primer selection program (available to the pubUc from the Genome Center at University of 
Texas South West Medical Center, DaUas TX) is capable of choosing specific primers from megabase 
sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer 
selection program (available to the public from the Whitehead Institute/Mrr Center for Genome 
Research, Cambridge MA) aUows the user to input a "mispriming library," in which sequences to 
15 avoid as primer binding sites are user-specified. PrimerB is usefiil. in particular, for the selection of 
oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained from then: respective sources and modified to meet die user's specific needs.) The 
PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 
Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 
20 selection of prhners that hybridize to either the most conserved or least conserved regions of aligned 
nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oUgonucleotides and polynucleotide fragments 
identified by any of the above selection methods are usefiil in hybridization technologies, for 
example, as PGR or sequencing primers, microarray elements, or specific probes to identify fiilly or 
25 partially complementary polynucleotides in a sanq)le of nucleic acids. Methods of oligonucleotide 
selection are not limited to those described above. 

A "recombinant nucleic acid" is a nucleic acid that is not naturaUy occurring or has a 
sequence that is made by an artificial combination of two or more otherwise separated segments of 
sequence. This artificial combination is often accomplished by chemical synthesis or. more 
commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic 
engineering techniques such as those described in Sambrtiok and Russell (supra). The term 
recombmant includes nucleic acids that have been altered solely by addition, substitution, or deletion 
of a portion of the nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid 
sequence operably linked to a promoter sequence. Such a recombinant nucleic acid may be part of a 
vector that is used, for exanqile, to transform a cell. 
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Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recoinbinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
5 regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
10 chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA molecule, is composed of the same linear 
sequence of nucleotides as the reference DNA molecule with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
15 instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing CGDD, 
nucleic acids encoding CGDD, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organeUe, or membrane isolated from a cell; a ceU; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 
20 The terms "specific binding" and "specifically binding" refer to that interaction between a 

protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
25 comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody mil reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% free, 
preferably at least about 75% free, and most preferably at least about 90% free from other 
30 components with which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different ammo acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
35 microparticles and capillaries. The substrate can have a variety of surface forms, such as weUs, 
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trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" or "expression profile" refers to the collective pattern of gene 
expression by a particular cell type or tissue under given conditions at a given time. 

'Transformation" describes a process by which exogenous DNA is introduced into a recipient 
5 cell. Transfomiation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 

10 "transformed cells" includes stably transfom^ cells in which the inserted DNA is capable of 

replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 
as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
anunals and plants, in which one or more of the cells of the organism contains heterologous nucleic 

15 acid mtroduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirecdy by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. In another enobodiment, the nucleic acid can be introduced by infection with a 
recombinant viral vector, such as a lentiviral vector (Lois, C. et al. (2002) Science 295:868-872). The 

20 term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but 
rather is directed to the introduction of a recombinant DNA molecule. The transgenic organisms 
contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, plants 
and animals. The isolated DNA of the present mvention can be mtroduced into the host by methods 
known in the art, for example infection, transfection, transformation or transconjugation. Techniques 

25 for transferring tlie DNA of the present invention into such organisms are widely known and provided 
in references such as Sambrook and Russell (supra). 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 

30 1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater - 
sequence identity over a certam defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice," "species," or "polymorphic" variant A splice variant may have 

35 significant identity to a reference molecule, but will generally have a greater or lesser number of 
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polynucleotides due to alternate splicing of exons during raRNA processing. The coiresponding 
polypeptide may possess additional fiinctional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotides that vary from one species to another. The 
resulting polypeptides will generally have significant amino acid identity relative to each other. A 
polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for exanq)Ie, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity or sequence similarity to the particular polypeptide sequence over a 
certain length of one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool 
Version 2.0.9 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for 
example, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, 
or at least 99% or greater sequence identity or sequence similarity over a certain defmed length of one 
of the polypeptides. 
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THE INVENTION 

Various embodiments of the invention include new human proteins associated with cell 
growth, differentiation, and death (CGDD), the polynucleotides encoding CGDD, and the use of these 
compositions for the diagnosis, treatment, or prevention of cell proliferative disorders mcluding 
cancer, developmental disorders, neurological disorders, autoimmune/inflammatory disorders, 
reproductive disorders, and disorders of the placenta. 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
embodunents of the invention. Each polynucleotide and its corresponding polypeptide are correlated 
to a single Incyte project identification number (Licyte Project DD). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) aud an 
Incyte polypeptide sequence number (hicyte Polypeptide ID) as shown. Each polynucleotide 
30 sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ 
ID NO:) and an Incyte polynucleotide consensus sequence number (hicyte Polynucleotide ID) as 
shown. 

Table 2 shows sequences with homology to polypeptide embodiments of the invention as 
identified by BLAST analysis against the GenBank protem (genpept) database and the PROTEOME 
35 database. Columns 1 and 2 show the polypeptide sequence identification number (Polypeptide SEQ 
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ID NO:) and the corresponding Incyte polypeptide sequence number (Incyte Polypeptide ID) for 
polypeptides of the invention. Column 3 shows the GenBank identification number (GenBank ED 
NO:) of the nearest GenBank homolog and the PROTEOME database identification numbers 
(PROTEOME ID NO:) of the nearest PROTEOME database homologs. Column 4 shows the 
probability scores for the matches between each polypeptide and its homolog(s). Column 5 shows the 
annotation of the GenBank and PROTEOME database homolog(s) along with relevant citations where 
applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 
and 2 show die polypeptide sequence identification number (SEQ ID NO:) and the corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. 
Column 3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the 
MOTIFS program of the GCG sequence analysis software package (Accelrys, Burlington MA). 
Column 6 shows amino acid residues comprising signature sequences, domains, and motifs. Column 
7 shows analytical methods for protein structure/function analysis and in some cases, searchable 
databases to which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are proteins associated with cell growth, 
differentiation, and death. For example, SEQ ID NO: 13 is 97% identical, from residue S220 to 
residue A552 and 95% identical, from residue Ml to residue 1240, to human DOC-2 (GenBank ID 
gl297330) as determined by the Basic Local AUgnment Search Tool (BLAST). (See Table 2.) The 
BLAST probability score is 2.5e-297, which indicates the probabiUty of obtaining the observed 
polypeptide sequence alignment by chance. SEQ ID NO: 13 also has homology to protems that are 
mvolved in ceU proliferation and adhesion, and are DOC-2, as determined by BLAST analysis usmg 
the PROTEOME database. SEQ ID NO: 13 also contams a phosphotyrosme interaction (PTB/PID) 
domam as determined by searching for statistically significant matches in the hidden Markov model 
(HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from other 
BLAST analyses provide further corroborative evidence that SEQ ID NO: 13 is a IX)C-2. 

In another example, SEQ ID NO: 19 is 100% identical, from residue Ml to residue C191, to 
human D53 (GenBank ID gl469920) as detennined by BLAST. The BLAST probability score is 
4.6e-92. SEQ ID NO: 19 also has homology to proteins that play a role in cell proliferation, and are 
tumor protem D52-like 1 proteins, as determined by BLAST analysis using the PROTEOME 
database. Data from other BLAST analyses provide fiirther corroborative evidence that SEQ ID 
NO: 19 is a tumor protem D52 protein. 

In yet another example, SEQ ID NO:26 is 93% identical, from residue 146 to residue 1109, 
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and 100% identical, from residue Ml to residue E49, to murine mm-Mago (GenBank JD g2330011) 
as determined by BLAST. The BLAST probability score is 4.6e-54. SEQ ID NO:26 also has 
homology to proteins that have developmental ftmction, and are Mago-nashi proteins, as determined 
by BLAST analysis using the PROTEOME database. SEQ ID NO:26 also contains a Mago-nashi 
5 domain as determined by searching for statistically significant matches in the hidden Markov model 
(HMM)-based PFAM database. (See Table 3.) Data from additional BLAST analyses provide 
further corroborative evidence that SEQ E) NO:26 is a Mago-nashi protein. 

In a further example, SEQ E) NO:32 is 51% identical, ftom residue Ell to residue Y682, to 
human cytoplasmic protem, Roppl20 (GenBank ID gl4583268) as determined by BLAST. The 
10 BLAST probability score is 3.6e- 1 88. SEQ ID NO:32 also has homology to proteins that are 
localized to the cytoskeleton, and are similar to WD-repeat microtubule-associated proteins, as 
determined by BLAST analysis using the PROTEOME database. SEQ ID NO:32 also contains seven 
G-beta repeat WD domains as determined by searchmg for statistically significant matches in the 
hidden Markov model (HMM)-based PFAM database. (See Table 3.) Data from BLIMPS and other 
15 BLAST analyses provide further corroborative evidence that SEQ ID NO:32 is a human WD-repeat 
microtubule-associated protein. SEQ ID NO: 1-12, SEQ ID NO: 14-18,SEQ ID NO:20-25,SEQ ID 
NO:27-31, and SEQ ID NO:33 were analyzed and annotated in a similar manner. The algorithms and 
parameters for the analysis of SEQ ID NO: 1-33 are described in Table 7. 

As shown in Table 4, the full length polynucleotide embodiments were assembled using 
20 cDNA sequences or coding (exon) sequences derived from genomic DNA, or any combination of 
these two types of sequences. Column 1 lists the polynucleotide sequence identification number 
(Polynucleotide SEQ ID NO:), the corresponding Incyte polynucleotide consensus sequence number 
(hicyte ID) for each polynucleotide of the invention, and the length of each polynucleotide sequence 
in basepairs. Column 2 shows the nucleotide start (5') and stop (3') positions of the cDNA and/or 
25 genomic sequences used to assemble the full length polynucleotide embodiments, and of fragments of 
the polynucleotides which are useful, for example, in hybridization or amplification technologies that 
identify SEQ ID NO:34-66 or that distinguish between SEQ ID NO:34-66 and related 
polynucleotides. 

The polynucleotide fragments described m Column 2 of Table 4 may refer specifically, for 
30 example, to Mcyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 

libraries. Alternatively, the polynucleotide fragments described m column 2 may refer to GenBank 
cDNAs or ESTs which contributed to the assembly of the full length polynucleotides. In addition, the 
polynucleotide fragments described in column 2 may identify sequences derived from the ENSEMBL 
(The Sanger Centre, Cambridge, UK) database (i.e., those sequences mcluding the designation 
35 "ENST"). Alternatively, the polynucleotide fragments described in column 2 may be derived from 
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the NCBI RefSeq Nucleotide Sequence Records Database (i.e., those sequences including the 
designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (i.e., those sequences 
including the designation "NP"). Alternatively, the polynucleotide fragments described in column 2 
may refer to assemblages of both cDNA and Genscan-piedicted exons brought together by an "exon 
stitching" algorithm. For example, a polynucleotide sequence identified as 
FLJ(XXXXX_Ni_N2_YYYYY_Ns_N4 represents a "stitched" sequence in which XXXXXX is the 
identification number of the cluster of sequences to which the algorithm was applied, and YYYYY is 
the number of the prediction generated by the algorithm, and Nj^^.^...' if present, represent specific 
exons that may have been manually edited during analysis (See Exan^le V). Alternatively, the 
polynucleotide fragments in column 2 may refer to assemblages of exons brought together by an 
"exon-stretching" algorithm. For example, a polynucleotide sequence identified as 
FLXXXXXX_gAAAAA_gBBBBB_l_N is a "stretched" sequence, with XXXXXX being the Incyte 
project identificarion number, gA^4>U being the GenBank identification number of the human 
genomic sequence to which the "exon-stretching" algorithm was applied, gBBBBB being the 
GenBank identification number or NCBI RefSeq identification number of the nearest GenBank 
protein homolog, and referring to specific exons (See Example V). In instances where a RefSeq 
sequence was used as a protein homolog for the "exon-stretching" algorithm, a RefSeq identifier 
(denoted by "NM," "NP," or "NT") may be used in place of the GenBank identifier (i.e., gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 
following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for exanq>le, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Conq)uter Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Exan^le V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript. 



hi some cases, Incyte cDNA coverage redundant with the sequence coverage shown in Table 
4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant iicyte cDNA 
identification numbers are not shown. 
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Table 5 shows the representative cDNA libraries for those Ml length polynucleotides which 
were assembled using Incyte cDNA sequences. The representative cDNA Ubrary is the Incyte cDNA 
library which is most frequently represented by the Licyte cDNA sequences which were used to 
assemble and confirm the above polynucleotides. The tissues and vectors which were used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

Table 8 shows single nucleotide polymorphisms (SNPs) found in polynucleotide sequences of 
the mvention, along with allele frequencies in different human populations. Colunms 1 and 2 show 
the polynucleotide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
project identification number (PDD) for polynucleotides of the mvention. Colunrn 3 shows the Incyte 
identification number for the EST in which the SNP was detected (EST ID), and column 4 shows the 
identification number for the SNP (SNP ID). Column 5 shows the position within the EST sequence 
at which the SNP is located (EST SNP), and column 6 shows the position of the SNP within the fiill- 
length polynucleotide sequence (CBl SNP). Column 7 shows the allele found in the EST sequence. 
Columns 8 and 9 show the two aUeles found at the SNP site. Column 10 shows the amino acid 
encoded by the codon including the SNP site, based upon the aUele found in the EST. Columns 11-14 
show the frequency of aUele 1 m four different human populations. An entry of n/d (not detected) 
indicates that the frequency of aUele 1 in the population was too low to be detected, while n/a (not 
available) indicates that the allele frequency was not determined for the population . 

The invention also encompasses CGDD variants. Various embodunents of CGDD variants 
can have at least about 80%, at least about 90%, or at least about 95% amino acid sequence identity to 
the CGDD ammo acid sequence, and can contain at least one functional or structural characteristic of 
CGDD. 

Various embodiments also encompass polynucleotides which encode CGDD. In a particular 
embodunent, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ED NO:34-66, which encodes CGDD. The polynucleotide 
sequences of SEQ ID NO:34-66, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherem occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses variants of a polynucleotide encoding CGDD. hi particular, 
such a variant polynucleotide wiU have at least about 70%, or alternatively at least about 85%, or 
even at least about 95% polynucleotide sequence identity to a polynucleotide encoding CGDD. A 
particular aspect of the invention encompasses a variant of a polynucleotide comprising a sequence 
selected from the group consisting of SEQ ID NO:34-66 which has at least about 70%, or 
alternatively at least about 85%, or even at least about 95% polynucleotide sequence identity to a 
nucleic acid sequence selected from the group consisting of SEQ ID NO:34-66. Any one of the 
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polynucleotide variants described above can encode a polypeptide which contains at least one 
fiinctional or structural characteristic of CGDD. 

hi addition, or in the alternative, a polynucleotide variant of the mvention is a splice variant 
of a polynucleoride encoding CGDD. A spUce variant may have portions which have significant 
5 sequence identity to a polynucleotide encoding CGDD, but will generally have a greater or lesser 
number of polynucleotides due to additions or deletions of blocks of sequence arising from alternate 
splicing of exons during mRNA processing. A splice variant may have less than about 70%, or 
alternatively less than about 60%, or alternatively less than about 50% polynucleotide sequence 
identity to a polynucleotide encoding CGDD over its entire length; however, portions of the splice 

10 variant will have at least about 70%, or alternatively at least about 85%, or alternatively at least about 
95%. or alternatively 100% polynucleotide sequence identity to portions of the polynucleotide 
encoding CGDD. For example, a polynucleotide comprising a sequence of SEQ ID NO:42 is a splice 
variant of a polynucleotide comprising a sequence of SEQ ID NO:43; a polynucleotide comprising a 
sequence of SEQ ID NO:45 is a splice variant of a polynucleotide comprising a sequence of SEQ ID 

15 NO:55; and a polynucleotide comprising a sequence of SEQ ID NO:56 is a splice variant of a 

polynucleotide comprismg a sequence of SEQ E) NO:57. Any one of the splice variants described 
above can encode a polypeptide which contains at least one functional or structural characteristic of 
CGDD. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
20 genetic code, a multitude of polynucleotide sequences encoding CGDD, some bearing minimal 
sinularity to the polynucleotide sequences of any known and naturally occurring gene, may be 
Foduced. Thus, the mvention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selectmg combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
25 polynucleotide sequence of naturaUy occurring CGDD, and all such variations are to be considered as 
being specifically disclosed. 

Although polynucleotides which encode CGDD and its variants are generally capable of 
hybridizmg to polynucleotides encoding naturally occurring CGDD under appropriately selected 
conditions of stringency, it may be advantageous to produce polynucleotides encoding CGDD or its 

30 derivatives possessing a substantially different codon usage. e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 
particular prokaiyotic or eukaryotic host in accordance with the frequency with which particular 
codons are utilized by die host. Other reasons for substantially altering the nucleotide sequence 
encoding CGDD and its derivatives without altering the encoded ammo acid sequences include the 

35 production of RNA transcripts havmg more desurable properties, such as a greater half-Ufe, than 
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transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of polynucleotides which encode CGDD and 
CGDD derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic polynucleotide may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a polynucleotide encoding CGDD or any fragment thereof. 

Embodiments of the invention can also include polynucleotides that are capable of 
hybridizing to the claimed polynucleotides, and, in particular, to those having the sequences shown in 
SEQ ID NO:34-66 and fragments thereof, under various conditions of stringency (Wahl, G.M. and 
S.L. Berger (1987) MeUiods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
152:507-511). Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions," 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Biosciences, Piscataway NJ), or combinations 
of polymerases and proofreading exonucleases such as those found m the ELONGASE amplification 
system (Invitrogen, Carlsbad CA). Preferably, sequence preparation is automated with machines such 
as the MCROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal cycler (MJ 
Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Applied Biosystems). 
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Applied 
Biosystems), the MEGABACE 1000 DNA sequencing system (Amersham Biosciences), or other 
systems known in the art. The resulting sequences are analyzed using a variety of algoridmis which 
are well known in the art (Ausubel et al., supra, ch. 7; Meyers, R.A. (1995) Molecular Biolnp v anH 
BlQ^^9hnology, WHey VCH, New York NY, pp. 856-853). 

The nucleic acids encoding CGDD may be extended utUizing a partial nucleotide sequence 
and employing various PCR-based methods known in the art to detect upstream sequences, such as 
promoters and regulatory elements. For ejcample, one method which may be employed, 
restriction-site PGR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector (Sarkar, G. (1993) PGR Methods AppUc. 2:318-322). Another method, 
inverse PGR, uses primers that extend in divergent directions to amplify unknown sequence from a 
circularized tenq)late. The template is derived from restriction fragments comprising a known 
genomic locus and surrounding sequences (Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186). A 
third method, capture PGR, involves PGR ampUfication of DNA fragments adjacent to known 
sequences in human and yeast artificial chromosome DNA (Lagerstrom, M. et al. (1991) PGR 
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Methods Applic. 1:111-1 19). In this method, multiple restriction enzyme digestions and ligations 
may be used to insert an engineered double-stranded sequence into a region of unknown sequence 
before performing PGR. Other methods which may be used to retrieve unknown sequences are 
known in the art (Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). Additionally, one may 
5 use PGR, nested primers, and PROMOTERFINDER libraries (Glontech, Palo Alto CA) to walk 

genomic DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon 
junctions. For all PCR-based methods, primers may be designed using commercially available 
software, such as OLIGO 4.06 primer analysis software (National Biosciences, Plymouth MN) or 
another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
10 about 50% or more, and to aimeal to the template at temperatures of about 68°C to 72°C. 

When screening for fiall length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an ohgo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
15 into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confmn the nucleotide sequence of sequencing or PGR products. In particular, capiUary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific. laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
20 emitted wavelengths. Output/Ught intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENGE NAVIGATOR, Applied Biosystems), and the entire 
process fiiom loading of samples to computer analysis and electronic data display may be computer 
controlled. Gapillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular san^le. 
25 In another embodiment of the invention, polynucleotides or fragments thereof which encode 

CGDD may be cloned in recombinant DNA molecules that direct expression of CGDD, or fi:agments 
or functional equivalents thereof, in appropriate host ceUs. Due to the mherent degeneracy of the 
genetic code, other polynucleotides which encode substantially the same or a functionally equivalent 
polypeptides may be produced and used to express CGDD. 

The polynucleotides of the invention can be engineered using methods generally known in the 
art in order to alter CGDD^ncoding sequences for a variety of purposes mcluding, but not limited to, 
modification of the cloning, processing, and/or expression of the gene product. DNA shufflmg by 
random fragmentation and PGR reassembly of gene fragments and synthetic oligonucleotides may be 
used to engineer the nucleotide sequences. For example, oUgonucleotide-mediated site-directed 
mutagenesis may be used to mtroduce mutations that create new restriction sites, alter glycosylation 



68 



wo 03/050253 



PCTAJS02/39133 



patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLEOULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. etal. (1999) Nat. 
5 Biotechnol. 17:259-264; and Cranneri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 

improve the biological properties of CGDD, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombmation of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 

10 properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a smgle gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 

15 homologous genes in the same gene family, either from the same or different species, thereby 

maximizing the genetic diversity of multiple naturally occuning genes in a directed and controllable 
manner. 

In another embodiment, polynucleotides encoding CGDD may be synthesized, in whole or in 
part, using one or more chemical methods well known m the art (Caruthers, M.H. et al. (1980) 

20 Nucleic Acids Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232). 
Alternatively, CGDD itself or a fragment thereof may be synthesized using chemical methods known 
in the art. For example, peptide synthesis can be performed using various solution-phase or 
solid-phase techniques (Creighton, T. (1984) Proteins. Structures and Molecular Properties . WH 
Freeman, New York NY, pp. 55-60; Roberge, J.Y. et al. (1995) Science 269:202-204). Automated 

25 synthesis may be achieved using the ABI 43 1 A peptide synthesizer (Applied Biosystems). 

Additionally, the amino acid sequence of CGDD, or any part diereof, may be altered during direct 
synthesis and/or combined with sequences from other proteins, or any part thereof, to produce a 
variant polypeptide or a polypeptide having a sequence of a naturally occurring polypeptide. 
The peptide may be substantially purified by preparative high performance Uquid 

30 chromatography (Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421). The 
composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing 
(Creighton, supra, pp. 28-53). 

In order to express a biologically active CGDD, the polynucleotides encoding CGDD or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 

35 the necessary elements for transcriptional and translational control of the inserted coding sequence in 
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a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3' untranslated regions in the vector and in polynucleotides encoding 
CGDD. Such elements may vary in their strength and specificity. Specific initiation signals may also 
be used to achieve more efficient translation of polynucleotides encoding CGDD. Such signals 
5 mclude the ATG uiitiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where a 
polynucleotide sequence encoding CGDD and its initiation codon and upstream regulatory sequences 
are inserted into the appropriate expression vector, no additional transcriptional or translational 
control signals may be needed. However, in cases where only coding sequence, or a fragment thereof, 
is mserted, exogenous translational control signals mcluding an in-frame ATG initiation codon should 

10 be provided by the vector. Exogenous translational elements and initiation codons may be of various 
origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 
enhancers appropriate for the particular host ceil system used (Scharf, D. et al. (1994) Results Probl. 
CeU Differ. 20: 125-162). 

Methods which are well known to those skilled in the art may be used to construct expression 

15 vectors contaming polynucleotides encodmg CGDD and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination (Sambrook and Russell, supra, ch. 1-4, and 8; Ausubel et al., supra, 
ch. 1, 3, and 15). 

A variety of expression vector/host systems may be utilized to contain and express 

20 polynucleotides encoding CGDD. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; 
yeast transformed with yeast expression vectors; insect cell systems infected with viral expression 
vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., 
cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors 

25 (e.g., Ti or pBR322 plasmids); or animal cell systems (Sambrook and Russell, supra; Ausubel et al., 
supra; VanHeeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. 
(1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 
1945; Takamatsu, N. (1987) EMBO J. 6:307-311; The McGraw Hill Yearbook of Science and 
Technology (1992) McGraw ffiU, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. 

30 Natl. Acad. Sci. USA 81:3655-3659; Hanington, J.J. et al. (1997) Nat. Genet. 15:345-355). 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia vmises, or from 
various bacterial plasmids, may be used for deUvery of polynucleotides to die targeted organ, tissue, 
or cell population (Di Nicola, M. et al. (1998) Cancer Gen. Ther. 5:350-356; Yu, M. et al. (1993) 
Proc. Natl. Acad. Sci. USA 90:6340-6344; Buller, R.M. et al. (1985) Namre 317:813-815; McGregor, 

35 D.P. et al. (1994) Mol. Immunol. 31:219-226; Verma, I.M. and N. Somia (1997) Nature 389:239- 
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242). The invention is not limited by the host ceU employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon die use intended for polynucleotides encoding CGDD. For example, routine cloning, 
subcloning, and propagation of polynucleotides encoding CGDD can be achieved using a 
5 multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORTl 

plasmid (Invitrogen). Ligation of polynucleotides encoding CGDD into the vector's multiple cloning 
site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 

10 nested deletions in the cloned sequence (Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509). When large quantities of CGDD are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of CGDD may be used. For example, vectors containing 
the strong, inducible SP6 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of CGDD. A number of vectors 

15 containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 

promoters, may be used in die yeast Saccharomyces cerevisiae or Pichia pastoris. In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign polynucleotide sequences into the host genome for stable propagation (Ausubel 
et al., supra; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; Scorer, CA. et al. (1994) 

20 Bio/Technology 12:181-184). 

Plant systems may also be used for expression of CGDD. Transcription of polynucleotides 
encoding CGDD may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-311). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 

25 promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 3: 1671-1680; Broglie, R. et ai. (1984) 

Science 224:838-843; Winter, J. et al. (1991) Results Probl. CeU Differ. 17:85-105). These constructs 
can be introduced into plant cells by direct DNA transformation or padiogen-mediated transfection 
(The McGra w Hill Yearbook of Science and Technolopv (1992) McGraw Hill, New York NY, pp. 
191-196). 

30 In mammalian cells, a number of viral-based expression systems may be utilized. In cases 

where an adenovirus is used as an expression vector, polynucleotides encoding CGDD may be ligated 
mto an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 
obtain infective virus which expresses CGDD in host cells (Logan. J. and T. Shenk (1984) Proc. Natl. 

35 Acad. Sci. USA 81:3655-3659). In addition, transcription enhancers, such as the Rous sarcoma virus 
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(RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EB V-based 
vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes (Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355). 

For long term production of recombinant proteins in mammalian systems, stable expression of 
CGDD in cell lines is preferred. For example, polynucleotides encoding CGDD can be transfomied 
into cell lines using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
FoUowing the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
enriched media before being switched to selective media. The purpose of the selectable marker is to 
confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk and apf cells, respectively (Wigler, M. et al. (1977) 
CeU 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823). Also, antimetabolite, antibiotic, or 
herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricm acetyltransferase, respectively (Wigler, M. et 
al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 
150:1-14). Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular 
requirements for metabolites (Hartman, S.C and R.C Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051). Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Qontech), p- 
glucuronidase and its substrate p-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system (Rhodes, C.A. (1995) 
Methods Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding CGDD is inserted within a marker gene sequence, transformed ceUs containing 
polynucleotides encoding CGDD can be identified by the absence of marker gene fimction. 
Alternatively, a marker gene can be placed in tandem with a sequence encoding CGDD under the 
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control of a single promoter. Expression of the marker gene in response to induction or selection 
usually iudicates expression of the tandem gene as well. 

In general, host cells that contain the polynucleotide encoding CGDD and that express CGDD 
may be identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR amplification, and 
protein bioassay or immunoassay techniques which mclude membrane, solution, or chip based 
technologies for the detection and/or quantification of nucleic acid or protein sequences. 

hnmunological methods for detecting and measuring the expression of CGDD using either 
specific polyclonal or monoclonal antibodies are known m the art. Examples of such techniques 
mclude enzyme-Unked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based mmiunoassay utilizing 
monoclonal antibodies reactive to two non-mterfering epitopes on CGDD is prefen'ed, but a 
competitive bindmg assay may be employed. These and other assays are weU known m the art 
(Hampton, R. et al. (1990) Serological Methods, a Lahoratorv Manual . APS Press, St. Paul MN, Sect. 
IV; Coligan, J.E. et al. (1997) Current Protocols in Tmmimnl^pY, Greene Pub. Associates and Wiley- 
Interscience, New York NY; Pound, J.D. (1998) Immunochemical Prnfrvnl.. Humana Press, Totowa 
NJ). 

A wide variety of labels and conjugation techniques are known by those skilled m the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding CGDD 
include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, polynucleotides encoding CGDD, or any fragments thereof, may be cloned into a 
vector for the production of an mRNA probe. Such vectors are known in die art, are commercially 
available, and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA 
polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted 
using a variety of commercially available kits, such as those provided by Amersham Biosciences, 
Promega (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be 
used for ease of detection include radionuclides, enzymes, fluorescent, chemUuminescent, or 
chromogenic agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host ceUs transformed with polynucleotides encodmg CGDD may be cultured under 
conditions suitable for the expression and recovery of tiie protein fi-om cell culture. The protein 
produced by a tiransformed cell may be secreted or retained intiracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skUl in die art, expression vectors 
containing polynucleotides which encode CGDD may be designed to contain signal sequences which 
direct secretion of CGDD tijrough a prokaryotic or eukaryotic ceil membrane. 
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In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted polynucleotides or to process the expressed protein in the desired fashion. Such 
modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves 
a "prepro" or "pro" form of the protein may also be used to specify protein targeting, folding, and/or 
activity. Different host cells which have specific cellular machinery and characteristic mechanisms 
for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombmant polynucleotides 
encoding CGDD may be ligated to a heterologous sequence resulting in translation of a fusion protein 
in any of the aforementioned host systems. For example, a chimeric CGDD protein containing a 
heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of CGDD activity. Heterologous protein and peptide 
moieties may also facilitate purification of fusion proteins using commercially available affmity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
bmdmg protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, 
and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purificarion of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, cahnodulin, and metal-chelate 
resms, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of 
fusion protems using commercially available monoclonal and polyclonal antibodies that specifically 
recognize these epitope tags. A fusion protein may also be engineered to contam a proteolytic 
cleavage site located between the CGDD encoding sequence and the heterologous protein sequence, 
so that CGDD may be cleaved away from the heterologous moiety following pmification. Methods 
for fusion protein expression and purification are discussed in Ausubel et al. {supra, ch. 10 and 16). 
A variety of commercially available kits may also be used to facilitate expression and purification of 
fusion protems. 

In another embodunent, synthesis of radiolabeled CGDD may be achieved in vitro usmg the 
TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems couple 
transcription and translation of protein-coding sequences operably associated with the T7, T3, or SP6 
promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, ^'S-methionine. 

CGDD, fragments of CGDD, or variants of CGDD may be used to screen for compounds that 
specifically bind to CGDD. One or more test compounds may be screened for specific binding to 
CGDD. In various embodiments, 1, 2, 3, 4, 5, 10, 20. 50, 100, or 200 test compounds can be screened 
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for specific binding to CGDD. Examples of test compounds can include antibodies, anticalins, 
oligonucleotides, proteins (e.g., ligands or receptors), or small molecules. 

In related embodiments, variants of CGDD can be used to screen for binding of test 
compounds, such as antibodies, to CGDD, a variant of CGDD, or a combination of CGDD and/or one 
or more variants CGDD. In an embodiment, a variant of CGDD can be used to screen for compounds 
that bind to a variant of CGDD, but not to CGDD having the exact sequence of a sequence of SEQ ID 
NO: 1-33. CGDD variants used to perform such screening can have a range of about 50% to about 
99% sequence identity to CGDD, with various embodiments having 60%, 70%, 75%, 80%, 85%, 
90%, and 95% sequence identity. 

In an embodiment, a compound identified in a screen for specific binding to CGDD can be 
closely related to the natural ligand of CGDD, e.g., a hgand or firagment thereof, a natural substrate, a 
structural or functional mimetic, or a natural binding partner (Coligan, J.E. et al. (1991) Current 
Protocols in Immunolopr l(2):Chapter 5). In another embodiment, the compound thus identified can 
be a natural hgand of a receptor CGDD (Howard, A.D. et al. (2001) Trends Pharmacol. Sci.22:132- 
140; Wise, A. et al. (2002) Drug Discovery Today 7:235-246). 

In other embodunents, a compound identified m a screen for specific bindmg to CGDD can 
be closely related to the natural receptor to which CGDD binds, at least a fragment of the receptor, or 
a fragment of the receptor including aU or a portion of the hgand binding site or binding pocket. For 
example, the compound may be a receptor for CGDD which is capable of propagatmg a signal, or a 
decoy receptor for CGDD which is not capable of propagatmg a signal (Ashkenazi, A. and V.M. Divit 
(1999) Curr. 0pm. Cell Biol. 11:255-260; Mantovani, A. et al. (2001) Trends Immunol. 22:328-336). 
The compound can be rationally designed using known techniques. Examples of such techniques 
include those used to construct the compound etanercept (ENBREL; Amgen hic. Thousand Oaks 
CA), which is efficacious for treating rheumatoid arthritis in humans. Etanercept is an engineered 
p75 tumor necrosis factor (TNF) receptor duner linked to the Fc portion of human IgG , (Taylor, P.C. 
et al. (20O1) Curr. Opin. Immunol. 13:611-616). 

In one embodiment, two or more antibodies having similar or, alternatively, different 
specificities can be screened for specific binding to CGDD, fragments of CGDD, or variants of 
CGDD. The binding specificity of the antibodies thus screened can thereby be selected to identify 
particular fragments or variants of CGDD. In one embodiment, an antibody can be selected such that 
its binding specificity allows for preferential identification of specific fragments or variants of 
CGDD. In anotiier embodiment, an antibody can be selected such that its binding specificity allows 
for preferential diagnosis of a specific disease or condition having increased, decreased, or otherwise 
abnormal production of CGDD. 

hi an embodiment, anticalins can be screened for specific binding to CGDD, fragments of 
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CGDD, or variants of CGDD. Anticalins are ligand-binding proteins that have been constructed 
based on a lipocalin scaffold (Weiss, G.A. and H.B. Lowman (2000) Chem. Biol. 7;R177-R184; 
Sketra, A. (2001) J. Biotechnol. 74:257-275). The protein architecture of lipocalins can include a 
beta-barrel havmg eight antiparallel beta-strands, which supports four loops at its open end. These 
5 loops fonn the natural ligand-binding site of the lipocalins, a site which can be re-engineered in vitro 
by amino acid substitutions to impart novel binding specificities. The amino acid substitutions can be 
made using methods known in the art or described herein, and can include conservative substitutions 
(e.g., substitutions that do not alter bmding specificity) or substitutions tiiat modestly, moderately, or 
significantly alter blading specificity. 

10 hi one embodiment, screening for compounds which specifically bind to, stimulate, or inhibit 

CGDD mvolves producing appropriate cells which express CGDD, either as a secreted protein or on 
the ceU membrane. Preferred cells can include ceUs from mammals, yeast, Drosophila, orK coli. 
Cells expressing CGDD or cell membrane fractions which contain CGDD are then contacted witii a 
test compound and binding, stimulation, or inhibition of activity of eitiier CGDD or the compound is 

15 analyzed. 

An assay may srniply test bindmg of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or otiier detectable label. For exairq,le, 
the assay may comprise die steps of combining at least one test compound with CGDD, eitiier in 
solution or affixed to a soUd support, and detecting the binding of CGDD to die compound. 
Alternatively, die assay may detect or measure binding of a test compound m the presence of a 
labeled competitor. Additionally, the assay may be carried out using ceU-free preparations, chemical 
Ubraries, or natural product mixtiires. and tiie test compound(s) may be free in solution or affixed to a 
sohd support. 

An assay can be used to assess the ability of a compound to bmd to its natural ligand and/or 
25 to inhibit die binding of its natural ligand to its natural receptors. Examples of such assays include 
radio-labeling assays such as diose described in U.S. Patent No. 5,914,236 and U.S. Patent No. 
6,372,724. hi a related embodiment, one or more amino acid substitutions can be introduced into a 
polypeptide compound (such as a receptor) to improve or alter its ability to bind to its natural ligands 
(Mattiiews, D.J. and J.A. Wells. (1994) Chem. Biol. 1:25-30). In anotiier related embodmient, one or 
more ammo acid substitutions can be introduced into a polypeptide compound (such as a hgand) to 
improve or alter its ability to bind to its natural receptors (Cunningham, B.C. and J.A. Wells (1991) 
Proc. Nati. Acad. Sci. USA 88:3407-341 1; Lowman, H.B. et al. (1991) J. Biol. Chem. 266: 10982- 
10988). 

CGDD, fragments of CGDD, or variants of CGDD may be used to screen for compounds diat 
modulate the activity of CGDD. Such compounds may include agonists, antagonists, or partial or 
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inverse agonists. In one embodiment, an assay is performed under conditions permissive for CGDD 
activity, wherein CGDD is combined with at least one test compound, and the activity of CGDD in 
the presence of a test compound is compared with the activity of CGDD in the absence of the test 
compound. A change m the activity of CGDD in the presence of the test compound is indicative of a 
5 compound that modulates the activity of CGDD. Alternatively, a test compound is combined with an 
in vitro or cell-free system comprising CGDD under conditions suitable for CGDD activity, and the 
assay is perfonned. In either of these assays, a test compound which modulates the activity' of CGDD 
may do so indkectly and need not come in direct contact with the test compound. At least one and up 
to a plurality of test compounds may be screened. 
10 hi another embodhnent. polynucleotides encoding CGDD or their mammalian homologs may 

be "knocked out" in an animal model system using homologous recombmation in embryonic stem 
(ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease (see, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337). For 
example, mouse ES ceUs, such as the mouse 129/SvJ cell Ime, are derived from the early mouse 
15 embryo and grown in culture. The ES cells are transformed with a vector containmg the gene of 

interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene Qteo; Capecchi. M R 
(1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombmation takes place using 
the Cre-loxP systena to knockout a gene of interest in a tissue- or developmental stage-specific mamier 
(Marth. J.D. (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 
25:4323-4330). Transformed ES cells are identified and microinjected mto mouse cell blastocysts 
such as those from the C57Biy6 mouse strain. The blastocysts are surgically transferred to 
pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
25 therapeutic or toxic agents. 

Polynucleotides encoding CGDD may also be manipulated in vitro in ES ceUs derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages includmg endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for exanqjle, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
30 (1998) Science 282: 1 145-1 147). 

Polynucleotides encoding CGDD can also be used to create ' Wkin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model hmnan disease. With knockm technology, a 
region of a polynucleotide encodmg CGDD is injected into animal ES cells, and die injected sequence 
integrates into the animal cell genome. Transfonned ceUs are injected mto blastulae, and the 
blastulae are unplanted as described above. Transgenic progeny or inbred lines are studied and 
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treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress CGDD, e.g., by secreting CGDD in its milk, may also 
serve as a convenient source of that protein (Janne. J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74) 
THERAPEUTICS 

Chemical and structural similarity, e.g., in die context of sequences and motifs, exists 
between regions of CGDD and proteins associated with cell growth, differentiation, and death. In 
addition, examples of tissues expressing CGDD can be found in Table 6 and can also be found in 
Example XI. Therefore. CGDD appears to play a K>le in ceU proliferative disorders including cancer, 
developmental disorders, neurological disorders, autoimmune/inflammatory disorders, reproductive 
disorders, and disorders of the placenta. In the treatment of disorders associated with increased 
CGDD expression or activity, it is desirable to decrease the expression or activity of CGDD. In the 
treatment of disorders associated with decreased CGDD expression or activity, it is desirable to 
increase the expression or activity of CGDD. 

Therefore, in one embodiment, tGDD or a fragment or derivative thereof may be 
15 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of CGDD. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
20 leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 
the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, colon, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, perns, prosiate. 
salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a developmental disorder such as 
renal tubular acidosis, anemia, Cushing's syndrome, achondroplastic dwarfism, Duchenne and Becker 
muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome (Wilms' tumor, aniridia, 
genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, myelodysplastic 
syndrome, hereditary mucoepithelial dysplasia, hereditary keratodermas, hereditary neuropathies such 
as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, hydrocephalus, seizure 
disorders such as Syndenham's chorea and cerebral palsy, spma bifida, anencephaly, 
30 craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss; a neurological 

disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease. Pick's disease, Hmitington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 
muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess. 
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suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
5 encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 
10 disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 
akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 
postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, 
and familial frontotemporal dementia; an autoimmune/inflarDmatory disorder such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, aduh respiratory distress syndrome, 
15 allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune 

hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal 
dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's 
20 syndrome, gout. Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel 
syndrome, multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, 
osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrorce, rheumatoid 
arthritis, scleroderma, SjSgren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, 
systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 
25 con^lications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, 

parasitic, protozoal, and hebninthic infections, and trauma; a reproductive disorder such as a disorder 
of prolactin production, infertility, including tubal disease, ovulatory defects, endometriosis, a 
disruption of the estrous cycle, a disruption of the menstrual cycle, polycystic ovary syndrome, 
ovarian hyperstimulation syndrome, an endometrial or ovarian tumor, a uterine fibroid, autoimmune 
30 disorders, ectopic pregnancy, teratogenesis; cancer of the breast, fibrocystic breast disease, 

galactorrhea; a disruption of spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer 
of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of 
the male breast, gynecomastia, hypergonadotropic and hypogonadotropic hypogonadism, 
pseudohermaphroditism, azoospermia, premature ovarian failure, acrosin deficiency, delayed puperty, 
35 retrograde ejaculation and anejaculation, haemangioblastomas, cystsphaeochromocytomas, 
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paraganglioma, cystadenomas of the epididymis, and endolymphatic sac tumors; and a disorder of the 
placenta such as preeclampsia, choriocarcinoma, abruptio placentae, placenta previa, placental or 
maternal floor infarction, placenta accreta, increate, and percreta, extrachorial placentas, 
chorangioma, chorangiosis, chronic villitis, placental villous endema, widespread fibrosis of the 
5 terminal villi, intervillous thrombi, hemorraghic endovasculitis, erythroblastosis fetalis, and 
nonimmune fetal hydrops. 

In another embodiment, a vector capable of expressing CGDD or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of CGDD including, but not limited to, those described above. 
10 In a further embodiment, a composition comprising a substantially purified CGDD in 

conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of CGDD including, but not limited to, 
those provided above. 

In still another embodiment, an agonist which modulates the activity of CGDD may be 
15 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of CGDD including, but not limited to, those listed above. 

In a further embodiment, an antagonist of CGDD may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of CGDD. Examples of such 
disorders mclude, but are not limited to, those cell proliferative disorders includmg cancer, 
20 developmental disorders, neurological disorders, autoimmune/inflammatory disorders, reproductive 
disorders, and disorders of the placenta described above. In one aspect, an antibody which 
specifically binds CGDD may be used directly as an antagonist or indirectly as a targeting or delivery 
mechanism for bringing a pharmaceutical agent to cells or tissues which express CGDD. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
25 encoding CGDD may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of CGDD including, but not limited to, those described above. 

In other embodiments, any protein, agonist, antagonist, antibody, complementary sequence, or 
vector embodiments may be administered in combination with other appropriate therapeutic agents. 
Selection of the appropriate agents for use in combination therapy may be made by one of ordinary 
30 skill in the art, according to conventional pharmaceutical principles. The combination of therapeutic 
agents may act synergistically to effect the treatment or prevention of the various disorders described 
above. Using this approach, one may be able to achieve therapeutic efficacy with lower dosages of 
each agent, thus reducing the potential for adverse side effects. 

An antagonist of CGDD may be produced using methods which are generally known in the 
35 art. In particular, purified CGDD may be used to produce antibodies or to screen libraries of 
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pharmaceutical agents to identify those which specifically bind CGDD. Antibodies to CGDD may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. In an embodiment, neutralizing antibodies (i.e., 
5 those which inhibit dimer formation) can be used therapeutically. Single chain antibodies (e.g., from 
camels or llamas) may be potent enzyme inhibitors and may have application m the design of peptide 
mimetics, and in the development of immuno-adsorbents and biosensors (Muyldermans, S. (2001) J. 
Biotechnol. 74:277-302). 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, camels, 

10 dromedaries, llamas, humans, and others may be immunized by injection with CGDD or with any 
fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, 
various adjuvants may be used to increase immunological response. Such adjuvants include, but are 
not limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oU emulsions, KLH, and dinitrophenol. 

15 Among adjuvants used in humans, BCG (bacilU Calmette-Guerin) and Corynebacterium parvum are 
especially preferable. 

It is preferred that die oligopeptides, peptides, or fragments used to induce antibodies to 
CGDD have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 

20 fragments are substantially identical to a portion of the amino acid sequence of the natural protein. 
Short stretches of CGDD amino acids may be fused with those of another protein, such as KLH, and 
antibodies to the chimeric molecule may be produced. 

Monoclonal antibodies to CGDD may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not 

25 limited to, the hybridoma technique, die human B-cell hybridoma technique, and die EBV-hybridoma 
technique (Kohler, G. et al. (1975) Namre 256:495-497; Kozbor, D. et al. (1985) J. Immunol. 
Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Nad. Acad. Sci. USA 80:2026-2030; Cole, S.P. et al. 
(1984) Mol. CeU Biol. 62: 109-120). 

In addition, techniques developed for the production of "chimeric antibodies," such as the 

30 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used (Morrison, S.L. et al. (1984) Proc. Natl. Acad. 
Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; Takeda, S. et al. (1985) 
Nature 314:452-454). Alternatively, techniques described for the production of single chain 
antibodies may be adapted, using methods known in the art, to produce CGDD-specific single chain 

35 antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
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generated by chain shuffling from random combinatorial inmiunoglobulin libraries (Burton, D.R. 
(1991) Proc. Nati. Acad. Sci. USA 88:10134-10137). 

Antibodies may also be produced by inducing m vivo production in the lymphocyte 
population or by screemng immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature (Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299). 

Antibody fragments which contain specific binding sites for CGDD may also be generated. 
For example, such fragments mclude, but are not limited to, F(ab), fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab)2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity (Huse, W.D. et al. (1989) 
Science 246:1275-1281). 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
15 polyclonal or monoclonal antibodies with estabUshed specificities are weU known in the art. Such 
immunoassays typically involve the measurement of complex formation between CGDD and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering CGDD epitopes is generally used, but a competitive binding assay may 
also be employed (Pound, supra). 
20 Various methods such as Scatchard analysis in conjunction with radioimmunoassay 

techniques may be used to assess the affinity of antibodies for CGDD. Affinity is expressed as an 
association constant, K., which is defined as the molar concentration of CGDD-antibody complex 
divided by the molar concentrations of fiee antigen and fiee antibody under equiUbrium conditions. 
The K. determined for a preparation of polyclonal antibodies, which are heterogeneous in theu- 
25 affmities for multiple CGDD epitopes, represents the average affinity, or avidity, of the antibodies for 
CGDD. The K. determined for a preparation of monoclonal antibodies, which are monospecific for a 
particular CGDD epitope, represents a true measure of affinity. High-affmity antibody preparations 
with K3 ranging from about 10' to 10'^ L/mole are preferred for use in immunoassays in which the 
CGDD-antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations 
30 with K. ranging from about 10'' to 10^ L/mole are preferred for use in immunopurification and sunilar 
procedures which ultimately require dissociation of CGDD. preferably in active form, from the 
antibody (Catty, D. (1988) Antibodies. Volume I: A Practic.1 Apprn..h irl Press, Washington DC; 
Liddell, J.E. and A. Oyer (1991) A Practical CniidMo Monoclonal AntihnH,Vc John Wiley & Sons, 
New York NY). 

35 The titer and avidity of polyclonal antibody preparations may be further evaluated to 
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determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation 
of CGDD-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 
5 guidelines for antibody quality and usage in various applications, are generally available (Catty, 
supra; Coligan et al., supra). 

In another embodiment of the invention, polynucleotides encoding CGDD, or any fragment or 
complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 
10 RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 
CGDD. Such technology is well known in the art, and antisense oligonucleotides or larger fragments 
can be designed fi-om various locations along the coding or control regions of sequences encoding 
CGDD (Agrawal, S., ed. (1996) Antisense Therap eutics. Humana Press, Totawa NJ). 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
15 sequences into appropriate target cells can be used. Antisense sequences can be delivered 

intraceUularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein (Slater, J.E. et 
al. (1998) J. AUergy Clin. Immunol. 102:469-475; Scanlon, K.J. et al. (1995) 9:1288-1296). 
Antisense sequences can also be introduced intraceUularly through the use of viral vectors, such as 
20 retrovirus and adeno-associated virus vectors (Miller, A.D. (1990) Blood 76:271; Ausubel et al., 
supra; Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63:323-347). Other gene delivery 
mechanisms include liposome-derived systems, artificial viral envelopes, and other systems known in 
the art (Rossi, J.J. (1995) Br. Med. Bull. 51:217-225; Boado, R.J. et al. (1998) J. Phann. Sci. 87:1308- 
1315; Morris, M.C. et al. (1997) Nucleic Acids Res. 25:2730-2736). 
25 In mother embodiment of the invention, polynucleotides encoding CGDD may be used for 

somatic or germline gene therapy. Gene therapy may be perfomed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an mherited adenosine deaminase (ADA) deficiency 
30 (Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) CeU 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal. R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, famiUal 
hypercholesterolemia, and hemophilia resulting fi-om Factor Vm or Factor K deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Veima, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
35 express a conditionally lethal gene product (e.g., m the case of cancers which result from unregulated 
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cell proliferation), or (iii) express a protein which afifords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. 
(1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA 93:11395-11399), 
hepatitis B or C vkus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cnizi). hi the 
case where a genetic deficiency in CGDD expression or regulation causes disease, the expression of 
CGDD from an appropriate population of transduced cells may alleviate the cUnical manifestations 
caused by the genetic deficiency. 

In a further embodiment of die invention, diseases or disorders caused by deficiencies m 
CGDD are treated by constructing mammalian expression vectors encodmg CGDD and mtroducing 
these vectors by mechanical means into CGDD-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro mclude (i) direct DNA micromjection into individual cells, (ii) 
baUistic gold particle delivery, (iii) Uposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 
15 Biochem. 62: 191-217; Ivies, Z. (1997) Cell 91:501-5 10; Boulay, J.-L. and H. R^cipon (1998) Cuir. 
Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of CGDD mclude, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH«>ERV (Stratagene, La Jolla CA), 
20 and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Qontech, Palo Alto CA). CGDD may 
be expressed usmg (i) a constitutively active promoter, (e.g., from cytomegalovkus (CMV), Rous 
sarcoma virus (RSV), SV40 vmis, thymidine kinase (TK), or P-actm genes), (ii) an mducible 
promoter (e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. 
Acad. Sci. USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and 
H.M. Blau (1998) Curr. Opin. Biotechnol. 9:451^56), commercially available m the T-REX plasmid 
(Invitrogen)); the ecdysone-inducible promoter (available m die plasmids PVGRXR and FIND; 
Invitrogen); the FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter 
(Rossi, F.M.V. and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of 
the endogenous gene encoding CGDD from a normal individual. 
30 Commercially available liposome transformation Mts (e.g., the PERFECT LIPID 

TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optunize experunental 
parameters. In the alternative, transfomiation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
35 ( 1982) EMBO J . 1 :841-845). The introduction of DNA to primary cells requures modification of 
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these standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to CGDD expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding CGDD under the control of an independent promoter or the retrovirus long 
5 terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovmis cw-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovmis vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on pubUshed data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herem. The vector is propagated in 
10 an appropriate vector producing cell Une (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protem such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Vkol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaining 
15 retrovirus packaging cell lines producing high transducing efficiency refroviral supernatant") 
discloses a method for obtainmg retrovirus packaging cell Imes and is hereby incorporated by 
reference. Propagation of retrovmis vectors, transduction of a population of cells (e.g., CD4* T- 
cells), and the return of transduced cells to a patient are procedures well known to persons skilled m 
the art of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020- 
7029; Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; 
Ranga, U. et al. (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283- 
2290). 

Ill an embodiment, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding CGDD to cells which have one or more genetic abnormalities with respect 
to the expression of CGDD. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovmis vectors have proven to 
be versatile for importing genes encoding immunoregulatory protems into intact islets in the pancreas 
(Csete. M.E. et al. (1995) Transplantation 27:263-268). Potentially useM adenovu-al vectors are 
described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenovkal vectors, see also Antinozzi, P.A. et al. (1999; Annu. 
Rev. Nutr. 19:511-544) and Verma, I.M. and N. Somia (1997; Nature 18:389:239-242). 

In another embodiment, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding CGDD to target cells which have one or more genetic abnormalities with 
respect to the expression of CGDD. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing CGDD to cells of the central nervous system, for which HS V has a 
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tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1-based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-i virus vector has also been disclosed in detail in U.S. 
Patent No. 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is hereby 
incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 which 
consists of a genome containing at least one exogenous gene to be transferred to a cell under the 
control of the appropriate promoter for purposes including human gene therapy. Also taught by this 
patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Goms, W.F. et al. (1999; J. Virol. 73:519-532) and Xu, H. et al. (1994; 
Dev. Biol. 163: 152-161). The manipulation of cloned herpesvirus sequences, the generation of 
recombmant virus following the transfection of multiple plasmids containing different segments of 
the large herpesvirus genomes, the growth and propagation of herpesvirus, and the infection of cells 
with herpesvirus are techniques well known to those of ordinary skill in the art. 

hi another embodiment, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encodmg CGDD to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 
resulting in the overproduction of capsid proteins relative to the viral protems with enzymatic activity 
(e.g., protease and polymerase). Similarly, msertmg the coding sequence for CGDD into the 
alphavirus genome m place of the capsid-coding region results in the production of a large number of 
CGDD-coding RNAs and the synthesis of high levels of CGDD in vector transduced cells. While 
alphavirus infection is tjpically associated with cell lysis within a few days, the ability to establish a 
persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis vims (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
appUcation (Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of CGDD into a variety of cell types. The specific transduction of a subset of 
cells in a population may require the sorting of cells prior to transduction. The methods of 
manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 
transfections, and performing alphavirus infections, are weU known to those with ordinary skill m the 
art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, mhibition 
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can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it 
causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature (Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Cair, Molecular and 
5 Immunologic Approaches, Futura Publishing, Mt. Kisco NY, pp. 163-177). A complementary 

sequence or antisense molecule may also be designed to block translation of mRNA by preventing the 
transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
10 molecule to complementary target RNA, followed by endonucleolytic cleavage. For exan^Ie, 
engineered hammerhead motif ribozyme molecules may specifically and efficienUy catalyze 
endonucleolytic cleavage of RNA molecules encoding CGDD. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the foUowing sequences: QUA, 
15 GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 
20 Complementary ribonucleic acid molecules and ribozymes may be prepared by any method 

known in the art for the synthesis of nucleic acid molecules. These include techniques for chemicaUy 
synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. Alternatively, 
RNA molecules may be generated by in vitro and in vivo transcription of DNA molecules encoding 
CGDD. Such DNA sequences may be incorporated into a wide variety of vectors with suitable RNA 
25 polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2' 0-methyl rather than phosphodiesterase 
30 linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the mclusion of nontraditional bases such as inosine, 
queosine, and wybutosme, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, 
cytidine, guanine, thymme, and uridine which are not as easily recognized by endogenous 
endonucleases. 

35 In other embodiments of the invention, the expression of one or more selected 
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polynucleotides of the present invention can be altered, inhibited, decreased, or silenced using RNA 
interference (RNAi) or post-transcriptional gene silencing (PTGS) methods known in the art. RNAi 
is a post-transcriptional mode of gene silencing in which double-stranded RNA (dsRNA) introduced 
into a targeted cell specifically suppresses the expression of the homologous gene (i.e., the gene 
5 bearing the sequence complementary to the dsRNA). This effectively knocks out or substantially 
reduces the expression of the targeted gene. PTGS can also be accomplished by use of DNA or DNA 
fragments as well. RNAi methods are described by Fire, A. et al. (1998; Nature 391:806-811) and 
Gura, T. (2000; Nature 404:804-808). PTGS can also be initiated by introduction of a complementary 
segment of DNA into the selected tissue using gene delivery and/or viral vector delivery methods 
10 described herein or known in the art. 

RNAi can be induced m mammalian cells by the use of small mterfering RNA also known as 
siRNA. SiRNA are shorter segments of dsRNA (typically about 21 to 23 nucleotides in length) that 
result in vivo from cleavage of introduced dsRNA by the action of an endogenous ribonuclease. 
SiRNA appear to be the mediators of the RNAi effect m mammals. The most effective siRNAs 
15 appear to be 21 nucleotide dsRNAs with 2 nucleotide 3' overhangs. The use of siRNA for inducing 
RNAi in mammalian cells is described by Elbashir, S.M. et al. (2001; Nature 411:494-498). 

SiRNA can either be generated indirectly by introduction of dsRNA into the targeted cell, or 
directly by mammalian transfection methods and agents described herein or known in the art (such as 
liposome-mediated transfection, viral vector methods, or other polynucleotide deUvery/introductory 
methods). Suitable SiRNAs can be selected by examining a transcript of the target polynucleotide 
(e.g., mRNA) for nucleotide sequences downstream from the AUG start codon and recordmg the 
occurrence of each nucleotide and the 3' adjacent 19 to 23 nucleotides as potential siRNA target sites, 
with sequences having a 21 nucleotide length being preferred. Regions to be avoided for target 
siRNA sites include the 5' and 3' untranslated regions (UTRs) and regions near the start codon (within 
25 75 bases), as these may be richer in regulatory protein binding sites. UTR-bindmg protems and/or 
translation initiation conq)lexes may interfere with binding of the siRNP endonuclease complex. The 
selected target sites for siRNA can then be compared to the appropriate genome database (e.g., 
human, etc.) using BLAST or other sequence comparison algorithms known in the art. Target 
sequences with significant homology to other codmg sequences can be elimmated from consideration. 
30 The selected SiRNAs can be produced by chemical synthesis methods known m the art or by in vitro 
transcription using commercially available methods and kits such as the SILENCER siRNA 
construction kit (Ambion, Austm TX). 

In altemative embodiments, long-term gene silencmg and/or RNAi effects can be induced m 
selected tissue using expression vectors that continuously express siRNA. This can be accomplished 
35 using expression vectors that are engineered to express hairpin RNAs (shRNAs) using methods 
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known in the art (see, e.g.. Brummelkamp, T.R. et al. (2002) Science 296:550-553; and Paddison P J 
et al. (2002) Genes Dev. 16:948-958). In these and related embodiments, shRNAs can be deUverld to 
target cells using expression vectors known in the art. An example of a suitable expression vector for 
delivery of siRNA is the PSILENCER1.0-U6 (circular) plasmid (Ambion). Once delivered to the 
5 target tissue, shRNAs are processed in vi.o into sxRNA-like molecules capable of cairying out gene- 
specific silencing. 

In various embodiments, the expression levels of genes targeted by RNAi or PTGS methods 
can be determined by assays for mRNA and/or protein analysis. Expression levels of the mRNA of a 
targeted gene, can be determined by northern analysis methods using, for example the 
10 NORTHERNMAX-GLY kit (Ambion); by microarray methods; by PCR methods; by real time PGR 
methods; and by other RNA/polynucleotide assays hiown in the art or described herein. Expression 
levels of the pioteiu encoded by the targeted gene can be determined by Western analysis using 
standard techniques known in the art. 

An additional embodiment of the invention encompasses a method for screening for a 
15 compomid which is effective in altering expression of a polynucleotide encoding CGDD. 

Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not lunited to. oligonucleotides, antisense oligonucleotides, triple heUx-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
^0 sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 
promoters of polynucleotide expression. Thus, in die treatment of disorders associated with mcreased 
CGDD expression or activity, a compomid which specifically inhibits expression of the 
polynucleotide encoding CGDD may be therapeutically useful, and m the treatment of disorders 
associated with decreased CGDD expression or activity, a compound which specifically promotes 
5 expression of tiie polynucleotide encoding CGDD may be tiierapeutically usefid. 

m various embodiments, one or more test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known m the art. mcluding chemical modification of a compound known to be effective in 
altermg polynucleotide expression; selection fi-om an existing, commercially-available or proprietary 
) hbrary of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of tiie target polynucleotide; and selection from a 
hbrary of chemical compounds created combinatorially or randomly. A sample comprismg a 
polynucleotide encodmg CGDD is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an mtact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations m die expression of a polynucleotide encoding CGDD are assayed 
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by any method commonly known in the art. Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding CGDD. The amount of hybridization may be quantified, thus 
forming the basis for a comparison of the expression of the polynucleotide both with and without 
5 exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 
exposed to a test compound indicates that the test compound is effective in altering the expression of 
the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a Schizosaccharomyces pombe gene expression 
system (Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Amdt, G.M. et al. (2000) Nucleic Acids 

10 Res. 28:E 15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. 
Res. Conamun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial hbrary of oUgonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 

15 Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be mtroduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 

20 using methods which are well knovra in the art (Goldman, C.K. et al. (1997) Nat. Biotechnol. 15:462- 
466). 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

25 An additional embodunent of the invention relates to the administration of a composition 

which generally comprises an active ingredient formulated with a pharmaceutically acceptable 
excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. 
Various formulations are commonly known and are thoroughly discussed in the latest edition of 
Remington's Pharmaceuti cal Sciences (Maack Publishmg, Easton PA). Such compositions may 

30 consist of CGDD, antibodies to CGDD, and mimetics, agonists, antagonists, or inhibitors of CGDD. 
Li various embodunents, the compositions described herein, such as pharmaceutical 
conq)ositions, may be administered by any number of routes including, but not limited to, oral, 
intravenous, intramuscular, intra-arterial, intramedullary, intt^athecal, intiraventricular, pulmonary, 
transdermal, subcutaneous, mtraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 

J5 Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
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These compositions are generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast- 
acting formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides and 
proteins), recent developments in the field of puhnonary delivery via the alveolar region of the lung 
5 have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. 
et al., U.S. Patent No. 5,997,848). Pulmonary delivery allows administration without needle 
injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination 
10 of an effective dose is well within the capability of those skilled m the art. 

Specialized fonns of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising CGDD or fi^gments thereof. For example, liposome preparations 
contaming a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of 
the macromolecule. Alternatively, CGDD or a fragment thereof may be joined to a short cationic N- 
15 terminal portion from the HIV Tat-1 protein. Fusion proteins dius generated have been found to 

transduce into the ceUs of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
20 or pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determme useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active mgredient, for example CGDD 
or fragments thereof, antibodies of CGDD, and agonists, antagonists or inhibitors of CGDD, which 
25 ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the EDjo (the dose therapeutically effective in 50% of the population) or LD^ (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
tharapeutic index, which can be expressed as the U^so/^so ratio. Compositions which exhibit large 
30 therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably withm a range of circulating concentrations that includes the ED50 with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 
35 The exact dosage will be determined by the practitioner, in light of factors related to the 
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subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 pig to 100,000 ^g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners m the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind CGDD may be used for the 
diagnosis of disorders characterized by expression of CGDD, or in assays to monitor patients being 
treated with CGDD or agonists, antagonists, or inhibitors of CGDD. Antibodies useful for diagnostic 
purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays 
for CGDD include methods which utilize the antibody and a label to detect CGDD m human body 
fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and 
may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring CGDD, including ELISAs, RIAs, and FACS, are known 
in the art and provide a basis for diagnosmg altered or abnormal levels of CGDD expression. Normal 
or standard values for CGDD expression are established by combining body fluids or cell extracts 
taken from normal manamalian subjects, for example, human subjects, with antibodies to CGDD 
under conditions suitable for con5>lex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of CGDD expressed in 
subject, control, and disease samples firom biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the mvention, polynucleotides encodmg CGDD may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotides, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of CGDD may be correlated 
with disease. The diagnostic assay may be used to determine absence, presence, and excess 
expression of CGDD, and to monitor regulation of CGDD levels during therapeutic intervention. 
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In one aspect, hybridization with PGR probes which are capable of detecting polynucleotides, 
including genomic sequences, encoding CGDD or closely related molecules may be used to identify 
nucleic acid sequences which encode CGDD. The specificity of the probe, whether it is made from a 
highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a conserved 
motif, and the stringency of the hybridization or amplification will determine whether the probe 
identifies only naturally occurring sequences encoding CGDD, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the CGDD encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:34-66 or from 
genomic sequences including promoters, enhancers, and introns of the CGDD gene. 

Means for producing specific hybridization probes for polynucleotides encoding CGDD 
include the cloning of polynucleotides encoding CGDD or CGDD derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to syntiiesize RNA probes in vitro by means of the addition of die appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as ^^P or ^'S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotides encoding CGDD may be used for the diagnosis of disorders associated with 
expression of CGDD. Exaaples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease (MCTD), myelofibrosis, paroxysmal noctiimal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, 
leukemia, lyn^jhoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 
the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, colon, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 
salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a developmental disorder such as 
renal tubular acidosis, anemia, Cushmg's syndrome, achondroplastic dwarfism, Duchenne and Becker 
muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome (Wihns' tumor, aniridia, 
genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, myelodysplastic 
syndrome, hereditary mucoepithelial dysplasia, hereditary keratodermas, hereditary neuropathies such 
as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, hydrocephalus, seizure 
disorders such as Syndenham's chorea and cerebral palsy, spma bifida, anencephaly, 
craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss; a neurological 
disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease. Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
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disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 
muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral memngitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial dirombophlebitis, myelitis and radiculitis, viral central nervous system 
5 disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 

Straussler-Scheinker syndrome, fatal familial msomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis. 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
10 nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dennatomyositis and polymyositis, 
inherited. metaboUc, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 
disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 
akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskmesia, dystonias, paranoid psychoses, 
15 postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, 
and familial frontotemporal dementia; an autoimmune/mflammatory disorder such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 
aUergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune 
hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal 
20 dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis. diabetes meUitus. emphysema, episodic lymphopenia with lymphocytotoxms, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's 
syndrome, gout. Graves' disease, Hashunoto's thyroiditis, hypereosmophilia, hritable bowel 
syndrome, multiple sclerosis, myasthenia gravis, myocardial or pericardial mflammation, 
25 osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis. Reiter's syndrome, rheumatoid 
arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, 
systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis. Werner syndrome, 
complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fimgal, 
parasitic, protozoal, and helmmtiiic infections, and tiramna; a reproductive disorder such as a disorder 
10 of prolactin production, infertility, mcludmg tubal disease, ovulatory defects, endometriosis, a 
disruption of the estrous cycle, a disruption of the menstrual cycle, polycystic ovaiy syndrome, 
ovarian hyperstimulation syndrome, an endometrial or ovarian tmnor. a uterine fibroid, autounmune 
disorders, ectopic pregnancy, teratogenesis; cancer of tiie breast, fibrocystic breast disease, 
galactorrhea; a disraption of spermatogenesis, abnormal spenn physiology, cancer of the testis, cancer 
5 of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, mqjotence, carcinoma of 
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the male breast, gynecomastia, hypergonadotropic and hypogonadotropic hypogonadism, 
pseudohermaphroditism, azoospermia, premature ovarian failure, acrosm deficiency, delayed puperty, 
retrograde ejaculation and anejaculation, haemangioblastomas, cystsphaeochromocytomas, 
paraganglioma, cystadenomas of the epididymis, and endolymphatic sac tumors; and a disorder of the 
placenta such as preeclampsia, choriocarcinoma, abruptio placentae, placenta previa, placental or 
maternal floor infarction, placenta accreta, increate, and percreta, extrachorial placentas, 
chorangioma, chorangiosis, chronic villitis, placental villous endema, widespread fibrosis of the 
terminal villi, intervillous thrombi, hemorraghic endovasculitis, erythroblastosis fetalis, and 
nonimmune fetal hydrops. Polynucleotides encoding CGDD may be used in Southern or northern 
analysis, dot blot, or other membrane-based technologies; in PGR technologies; in dipstick, pin, and 
multiformat ELISA-Iike assays; and m microarrays utilizing fluids or tissues from patients to detect 
altered CGDD expression. Such qualitative or quantitative methods are well known in the art. 

In a particular embodunent, polynucleotides encoding CGDD may be used in assays that 
detect the presence of associated disorders, particularly those mentioned above. Polynucleotides 
complementary to sequences encoding CGDD may be labeled by standard methods and added to a 
fluid or tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and the signal is quantified and 
compared with a standard value. If the amount of signal in the patient sample is significantly altered 
m comparison to a confrol sample then the presence of altered levels of polynucleotides encoding 
CGDD in the sample indicates the presence of the associated disorder. Such assays may also be used 
to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clmical 
trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 
CGDD, a normal or standard profile for expression is established. This may be accompUshed by 
combming body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding CGDD, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparmg the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtamed in this manner may be compared with values 
obtained from sanq)les from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression m the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
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days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may aOow health professionals 
to employ preventative measures or aggressive treatment earlier, thereby preventing the development 
or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding CGDD 
may involve the use of PGR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro. Oligomers will preferably contam a fragment of a polynucleotide 
encoding CGDD, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
CGDD, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
, quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from polynucleotides encoding CGDD 
may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions 
and deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods 
of SNP detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) 
and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from 
polynucleotides encoding CGDD are used to amplify DNA usmg die polymerase chain reaction 
(PGR). The DNA may be derived, for example, from diseased or normal tissue, biopsy samples, 
bodily fluids, and the like. SNPs m the DNA cause differences in the secondary and tertiary 
structures of PGR products in single-stranded form, and these differences are detectable using gel 
electrophoresis in non-denaturing gels. In fSCCP, the oUgonucleotide primers are fluorescently 
labeled, which allows detection of the anqjlimers in high-throughput equipment such as DNA 
sequencing machines. Additionally, sequence database analysis mediods, termed in silico SNP 
(isSNP), are capable of identiifying polymorphisms by comparing die sequence of mdividual 
overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods fUter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 
alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the 
high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

SNPs may be used to study the genetic basis of human disease. For example, at least 16 
common SNPs have been associated with non-insulin-dependent diabetes mellitus. SNPs are also 
usefiil for examining differences m disease outcomes m monogenic disorders, such as cystic fibrosis. 
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Sickle cell anemia, or chronic granulomatous disease. For example, variants in the mannose-binding 
lectin, MBL2. have been shown to be correlated with deleterious pulmonary outcomes in cystic 
fibrosis. SNPs also have utility in pharmacogenomics, the identification of genetic variants that 
influence a patient's response to a drug, such as life-threatening toxicity. For example, a variation in 
5 N-acetyl transferase is associated with a high incidence of peripheral neuropathy in response to the 
anti-tuberculosis drug isoniazid, while a variation in the core promoter of the AL0X5 gene results in 
diminished clinical response to treatment with an anti-asthma drug that targets the 5-lipoxygenase 
pathway. Analysis of the distribution of SNPs in different populations is useful for investigating 
genetic drift, mutation, recombination, and selection, as well as for tracmg the origins of populations 
10 and their migrations (Taylor, J.G. et al. (2001) Trends Mol. Med. 7:507-512; Kwok, F.-Y. and Z. Gu 
(1999) Mol. Med. Today 5:538-543; Nowotny, P. et al. (2001) Curr. Opin. Neurobiol. 11:637-641). 

Methods which may also be used to quantify the expression of CGDD include radiolabeling 
or biotinyladng nucleotides, coamplification of a control nucleic acid, and interpolating resuhs from 
standard curves (Melby. P.C. et al. (1993) J. Immunol. Mediods 159:235-244; Duplaa, C. et al. (1993) 
15 Anal. Biochem. 212:229-236). The speed of quantitation of multiple samples may be accelerated by 
running the assay in a high-throughput format where the oligomer or polynucleotide of interest is 
presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

In fiirther embodiments, oligonucleotides or longer fragments derived from any of the 
20 polynucleotides described herem may be used as elements on a microairay. The microarray can be 
used in transcript imaging techniques which monitor the relative expression levels of large numbers 
of genes simultaneously as described below. The microarray may also be used to identify genetic 
variants, mutations, and , polymorphisms. This mformation may be used to determme gene ftmction 
to miderstand the genetic basis of a disorder, to diagnose a disorder, to monitor progression/regression 
25 of disease as a function of gene expression, and to develop and monitor the activities of dierapeutic 
agents m the treatment of disease, hi particular, this mformation may be used to develop a 
pharmacogenomic profile of a patient in order to select the most appropriate and effective treatment 
regimen for that patient. For exanq,le. therapeutic agents which are highly effective and display the 
fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 
30 In another embodhnent. CGDD. fragments of CGDD, or antibodies specific for CGDD may 

be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
15 geneexpressionbyaparticulartissueorcelltype. Global gene expression patterns are analyzed by 
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quantifying the number of expressed genes and then- relative abundance under given conditions and at 
a given time (Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent No. 5,840,484; 
hereby expressly incorporated by reference herein). Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or then- complements to the totality of 
5 transcripts or reverse transcripts of a particular tissue or cell type. In one embodunent, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, 
10 biopsies, or other biological samples. The transcript image may thus reflect gene expression in vivo, 
as in the case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present invention 
may also be used in conjunction with in vitro model systems and preclmical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring envuronmental 
15 compounds. AU compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and 
toxicity (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson 
(2000) Toxicol. Lett. 1 12-1 13:467-471). If a test compound has a signature similar to that of a 
conqxjund with known toxicity, it is likely to share those toxic properties. These fingerprints or 
20 signatures are most usefiil and refined when they contain expression information from a large number 
of genes and gene famiUes. Ideally, a genome-wide measurement of expression provides the highest 
quality signature. Even genes whose expression is not altered by any tested compounds are important 
as well, as the levels of expression of these genes are used to normalize the rest of the expression 
data. The normalization procedure is usefiil for comparison of expression data after treatment with 
25 different compounds. While the assignment of gene fimction to elements of a toxicant signature aids 
in mterpretation of toxicity mechanisms, knowledge of gene fimction is not necessary for the 
statistical matching of signatures which leads to prediction of toxicity (see, for example. Press 
Release 00-02 from the National Institute of Environmental Health Sciences, released February 29, 
2000, available at http://www.niehs.nih.gov/oc/news/toxchip.htm). Therefore, it is important and 
30 desirable in toxicological screening using toxicant signatures to include aU expressed gene sequences. 
la an embodunent, the toxicity of a test compound can be assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to die polynucleotides of 
the present mvention, so that transcript levels conresponding to the polynucleotides of the present 
J5 invention may be quantified. The transcript levels m the treated biological sample are compared with 
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levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another embodiment relates to the use of the polypeptides disclosed herein to analyze the 
proteome of a tissue or cell type. The term proteome refers to the global pattern of protein expression 
5 in a particular tissue or cell type. Each protein component of a proteome can be subjected 
individually to further analysis. Proteome expression pattems, or profiles, are analyzed by 
quantifying the number of expressed proteins and their relative abundance under given conditions and 
at a given time. A profile of a cell's proteome may thus be generated by separating and analyzing the 
polypeptides of a particular tissue or cell type. In one embodiment, the separation is achieved using 

10 two-dimensional gel electrophoresis, in which proteins from a sample are separated by isoelectric 
focusing in the first dimension, and then according to molecular weight by sodium dodecyl sulfate 
slab gel electrophoresis in the second dimension (Steiner and Anderson, supra). The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot 

15 is generally proportional to the level of the protein in the sample. The optical densities of 

equivalently positioned protein spots from different samples, for example, from biological samples 
either treated or untreated with a test compound or therapeutic agent, are compared to identify any 
changes in protein spot density related to the treatment. The proteins in the spots are partially 
sequenced using, for example, standard methods employing chemical or enzymatic cleavage followed 

20 by mass spectrometry. The identity of the protein in a spot may be determined by comparing its 

partial sequence, preferably of at least 5 contiguous amino acid residues, to the polypeptide sequences 
of interest. In some cases, further sequence data may be obtamed for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for CGDD to quantify 
the levels of CGDD expression, hi one embodiment, the antibodies are used as elements on a 

25 microarray, and protein expression levels are quantified by exposing the irricroarray to the sample and 
detecting the levels of protein bound to each array element (Lueldng, A. et al. (1999) Anal. Biochem 
270:103-111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed 
by a variety of methods known m the art, for example, by reacting the proteins in the sample with a 
thiol- or amino-reactive fluorescent compound and detecting the amount of fluorescence bound at 

30 each array element. 

Toxicant signatures at the proteome level are also useful for toxicoiogical screening, and 
should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 
correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be 

35 useful m the analysis of compounds which do not significantly affect the transcript image, but which 
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alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to ■ 
rapid degradation of mRNA, so proteomic profiling may be more reliable and informative in such 
cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
5 sample containing proteins with the test compound. Proteins that are expressed in the treated 

biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological 
sample. A difference in the amount of protein between the two samples is indicative of a toxic 
response to the test compound in the treated sample. Individual proteins are identified by sequencing 

10 the amino acid residues of the individual proteins and comparing these partial sequences to the 
polypeptides of the present invention. 

In another embodunent, the toxicity of a test compound is assessed by treating a biological 
sample containing protems with the test compound. Proteins from the biological sample are 
incubated with antibodies specific to the polypeptides of the present invention. The amount of 

15 protein recognized by the antibodies is quantified. The amount of protein in the treated biological 
sample is compared with the amount in an untreated biological sample. A difference m the amount of 
protein between the two samples is indicative of a toxic response to the test compound in the treated 
sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art (Brennan, 
20 T.M. et al. (1995) U.S. Patent No. 5.474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. USA 
93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/25 1116; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; HeUer, M.J. et al. (1997) U.S. Patent No. 5,605,662). Various types of microarrays are well 
known and thoroughly described in Schena, M., ed. (1999; DNA Microarrays: A Practical Ap prn«rh 
25 Oxford University Press, London). 

In another embodiment of the mvention, nucleic acid sequences encoding CGDD may be used 
to generate hybridization probes useful in mapping the naturally occuiring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
30 of a multi-gene family may potentiaUy cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries (Harrington. J.J. et al. (1997) Nat. Genet. 
15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; Trask, B.J. (1991) Trends Genet. 7:149-154). 
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Once mapped, the nucleic acid sequences may be used to develop genetic linkage maps, for example, 
which correlate the inheritance of a disease state with the mheritance of a particular chromosome 
region or restriction fragment length polymorphism (RFLP) (Lander, E.S. and D. Botstein (1986) 
Proc. Natl. Acad. Sci. USA 83:7353-7357). 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data (Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968). Examples of genetic map data 
can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding CGDD on a physical 
map and a specific disorder, or a predisposition to a specific disorder, may help define the region of 
DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, 
any sequences mappmg to that area may represent associated or regulatory genes for further 
investigation (Gatti, R.A. et al. (1988) Nature 336:577-580). The nucleotide sequence of the instant 
'0 invention may also be used to detect differences in the chromosomal location due to translocation, 
inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, CGDD, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between CGDD and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest (Geysen, et al. (1984) PCT apphcation 
WO84/03564). In this method, large numbers of different small test compounds are synthesized on a 
I solid substrate. The test compounds are reacted with CGDD, or fragments thereof, and washed. 
Bound CGDD is then detected by methods well known in the art. Purified CGDD can also be coated 
durectiy onto plates for use in the aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a soUd support. 

In another embodiment, one may use competitive drug screening assays m which neutralizing 
antibodies capable of bindmg CGDD specifically compete with a test compound for binding CGDD. 
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In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 

antigenic determinants with CGDD. 

hi additional embodiments, the nucleotide sequences which encode CGDD may be used in 

any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
5 properties of nucleotide sequences that are currently known, including, but not limited to, such 

properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skiUed m the art can, using the preceding 

description, utilize the present invention to its fullest extent. The foUowing embodmients are, 

therefore, to be construed as merely iUustrative. and not limitative of the remainder of the disclosure 
10 in any way whatsoever. 

The disclosures of all patents, applications, and pubUcations mentioned above and below, 
including U.S. Ser. No. 60/340,747, U.S. Ser. No. 60/342,761, U.S. Ser. No. 60/349,705, U.S. Ser. 
No. 60/354,764, and U.S. Ser. No. 60/356,216, are hereby expressly mcorporated by reference. 

^ EXAMPLES 
I. Construction of cDNA Libraries 

facyte cDNAs were derived from cDNA libraries described m the LIFESEQ GOLD database 
(Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in guanidinium 
isothiocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of 
denaturants, such as TRIZOL (hivitrogen), a monophasic solution of phenol and guanidine 
isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted with 
chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 
ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases. RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTCX latex particles (QIAGEN, 
Chatsworth CA). or an OUGOTEX mRNA purification kit (QIAGEN). Altematively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austm TX). 

In some cases. Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Sfratagene) or SUPERSCRIPT plasmid system (Invitrogen), using the recommended 
procedm-es or similar methods known in the art (Ausubel et al, supra, ch. 5). Reverse transcription 
was initiated using oligo d{T) or random primers. Synthetic oligonucleotide adapters were Ugated to 
double sfranded cDNA. and the cDNA was digested with the appropriate restriction enzyme or 
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enzymes. For most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL SIOOO, 
SEPHAROSE CL2B, or SEPHAROSE CJAB colunm chromatography (Amersham Biosciences) or 
preparative agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites 
of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), PSPORTl plasmid 
5 (Invitrogen, Carlsbad CA), PCDNA2. 1 plasmid (Invitrogen), PBK-CMV plasmid (Stratagene), PC312- 
TOPOTA plasmid (Invitrogen), PCMV-ICIS plasmid (Stratagene), pIGEN (Iticyte Genomics, Palo 
Alto CA), pRARE (hicyte Genomics), or pINCY (Incyte Genomics), or derivatives thereof. 
Recombinant plasmids were transformed into competent £ coli cells including XLl-Blue, XLl- 
BlueMRF, or SOLR from Stratagene or DH5a, DHIOB, or ElectroMAX DHIOB from Invitrogen. 

10 11. Isolationof cDNA Qones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaitiiersburg MD); and QIAWELL 8 Plasmid, 

15 QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 
plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PGR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and diermal 

20 cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concenfration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland), 
in. Sequencing and Analysis 

25 Incyte cDNA recovered in plasmids as described m Exai^le n were sequenced as follows. 

Sequencing reactions were processed using standard methods or high-throughput mstrumentation 
such as the ABI CATALYST 800 (AppUed Biosystems) thermal cycler or the PTC-200 thermal 
cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 

30 using reagents provided by Amersham Biosciences or supplied in ABI sequencing kits such as the 
ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Amersham Biosciences); the 
ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 

35 protocols and base calling software; or other sequence analysis systems known in the art. Reading 
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frames within the cDNA sequences were identified using standard methods (Ausubel et al., supra, ch. 
7). Some of the cDNA sequences were selected for extension using the techniques disclosed in 
Exan^le VIH. 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
5 vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 

programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo sapiens, 

10 Rattus norvegicus. Mm miisculus, Caenorhabditis elegans, Saccharomyces cerevisiae, 

Schizosaccliaromyces ponibe, and Candida albicans (Incyte Genomics, Palo Alto CA); hidden 
Markov model (HMM)-based protein family databases such as PFAM, INCY, and TIGRFAM (Haft, 
D.H. et al. (2001) Nucleic Acids Res. 29:41-43); and HMM-based protein domain databases such as 
SMART (Schuhz, J. et al. (1998) Proc. Natl. Acad. Sci. USA 95:5857-5864; Letunic, L et al. (2002) 

15 Nucleic Acids Res. 30:242-244). (HMM is a probabilistic approach which analyzes consensus 
primary structures of gene families; see, for example, Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 
6:361-365.) The queries were performed usmg programs based on BLAST, FASTA, BLIMPS, and 
HMMER. The Incyte cDNA sequences were assembled to produce full length polynucleotide 
sequences. Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched sequences, 

20 or Genscan-predicted coding sequences (see Examples FV and V) were used to extend Incyte cDNA 
assemblages to full length. Assembly was performed using programs based on Phred, Phrap, and 
Consed, and cDNA assemblages were screened for open reading frames using programs based on 
GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated to derive 
the corresponding full length polypeptide sequences. Altematively, a polypeptide may begin at any of 

25 the methionine residues of the full length translated polypeptide. Full length polypeptide sequences 
were subsequently analyzed by querymg agamst databases such as the GenBank protein databases 
(genpept), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
hidden Markov model (HMM)-based protein family databases such as PFAM, INCY, and TIGRFAM; 
and HMM-based protein domain databases such as SMART. Full length polynucleotide sequences 

30 are also analyzed using MACDNASIS PRO software (MiraiBio, Alameda CA) and LASERGENE 
software (DNASTAR). Polynucleotide and polypeptide sequence aligmnents are generated using 
default parameters specified by the CLUSTAL algorithm as incorporated mto the MEGALIGN 
multisequence alignment program (DNASTAR), which also calculates the percent identity between 
aligned sequences. 

35 Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
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Incyte cDNA and full length sequences and provides applicable descriptions, references, and 
threshold parameters. The first column of Table 7 shows the tools, programs, and algorithms used, 
the second column provides brief descriptions thereof, the third column presents appropriate 
references, all of which are incorporated by reference herein in their entirety, and the fourth column 
5 presents, where applicable, the scores, probability values, and other parameters used to evaluate the 
strength of a match between two sequences (the higher the score or the lower the probability value, 
the greater the identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide and 
polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
10 NO: 34-66. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies are described in Table 4, column 2. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative proteins associated with cell growth, differentiation, and death were initially 
identified by running the Genscan gene identification program agamst public genomic sequence 
15 databases (e.g., gbpri and gbhtg). Genscan is a general-purpose gene identification program which 
analyzes genomic DNA sequences from a variety of organisms (Burge, C. and S. Karlin (1997) J. 
Mol. Biol. 268:78-94; Burge, C. and S. Karlin (1998) Curr. Opin. Struct. Biol. 8:346-354). The 
program concatenates predicted exons to form an assembled cDNA sequence extending fi-om a 
methionine to a stop codon. The output of Genscan is a FASTA database of polynucleotide and 

20 polypeptide sequences. The maximum range of sequence for Genscan to analyze at once was set to 
30 kb. To determine which of these Genscan predicted cDNA sequences encode proteins associated 
with ceU growth, differentiation, and death, the encoded polypeptides were analyzed by querymg 
against PFAM models for proteins associated with ceU growdi, differentiation, and death. Potential 
proteins associated with cell growth, differentiation, and death were also identified by homology to 

25 iQcyte cDNA sequences that had been annotated as proteins associated with cell growth, 

differentiation, and death. These selected Genscan-predicted sequences were then compared by 
BLAST analysis to the genpept and gbpri public databases. Where necessary, the Genscan-predicted 
sequences were then edited by conqjarison to the top BLAST hit from genpept to correct errors in the 
sequence predicted by Genscan, such as extra or omitted exons. BLAST analysis was also used to 

30 find any lacyte cDNA or public cDNA coverage of the Genscan-predicted sequences, thus providmg 
evidence for transcription. When Incyte cDNA coverage was available, this information was used to 
correct or confirm the Genscan predicted sequence. Full length polynucleotide sequences were 
obtained by assembling Genscan-predicted coding sequences with Incyte cDNA sequences and/or 
public cDNA sequences using the assembly process described in Exanqjle m. Alternatively, fuU 

35 length polynucleotide sequences were derived entirely fix)m edited or unedited Genscan-predicted 
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coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Seq uenri-s 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
5 identification program described in Example IV. Partial cDNAs assembled as described in Example 
ffl were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 
based on graph theory and dynamic programming to integrate cDNA and genomic information, 
generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
10 full length sequence. Sequence intervals in which the entire length of the interval was present on 

more than one sequence in the cluster were identified, and intervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
sequences, then all three intervals were considered to be equivalent. This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
15 thus identified were then "stitched" together by die stitching algorithm in the order that they appear 
along thek parent sequences to generate the longest possible sequence, as well as sequence variants. 
Linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 
genomic sequence to genomic sequence) were given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
20 by BLAST analysis to the genpept and gbpri public databases, hicorrect exons predicted by Genscan 
were corrected by comparison to die top BLAST hit from genpept. Sequences were further extended 
with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 
"Stretched" SBq iiftnrP« 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
25 analysis. First, partial cDNAs assembled as described in Example ffl were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
usmg the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protem with respect to the original GenBank protein homolog. The 
GenBank protem homolog. the chimeric protein, or both were used as probes to search for 
homologous genomic sequences from the public human genome databases. Partial DNA sequences 
were therefore "stretched" or extended by the addition of homologous genomic sequences. The 
resultant stretched sequences were examined to determine whether it contained a complete gene. 
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VI. Chromosomal Mapping of CGDD Encoding Polynucleotides 

The sequences which were used to assemble SEQ ED NO:34-66 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
5 SEQ ID NO:34-66 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and G^nethon were used to detemiine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
10 of all sequences of tliat cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an mterval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
aim. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
15 humans, although this can vary widely due to hot and cold spots of recombination.) The cM 

distances are based on genetic markers mapped by Gen^thon which provide boundaries for radiation 
hybrid markers whose sequences were included ui each of the clusters. Human genome maps and 
other resources available to the public, such as the NCBl "GeneMap'99" World Wide Web site 
(http://www.ncbi.nhn.nih.gov/genemap/), can be employed to determine if previously identified 
20 disease genes map within or m proximity to the intervals indicated above. 
Associarion of COn p Polvnucleotides with Parkinson's DiseasR 

Several genes have been identified as showing linkage to autosomal dominant forms of 
Parkinson's Disease (PD). PD is a common neurodegenerative disorder causing hradyMnesia, 
restmg tremor, muscular rigidity, and postural instabiUty. Cytoplasmic eosinophiHc inclusions 
25 called Lewy bodies, and neuronal loss especially in the substantia nigra pars compacta, are 

pathological hallmarks of PD (Valente, EM. et al (2001) Am. J. Hum. Genet. 68:895-900). Lewy 
body Parkinson disease has been thought to be a specific autosomal dominant disorder 
(Wakabayashi, K. et al. (1998) Acta Neuropath. 96:207-210). Juvenile parkinsonism may be a 
specific autosomal recessive disorder (Matsumine, H. et al. (1997) Am. J. Hum. Genet. 60: 
588-596, 1997). (Online Mendelian Inheritance in Man, OMIM. Johns Hopkins University, 
Baltimore, MD. MM Number: 168600: Sept. 9, 2002: . World Wide Web URL: 
http://www.ncbi.nlm.nih.gov/omim/) 

Association of a disease with a chromosomal locus can be determined by lod score. Lod 
score is a statistical method used to test the linkage of two or more loci within families having a 
genetic disease. The lod score is the logarithm to base 10 of the odds in favor of linkage. 
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Linkage is defined as the tendency of two genes located on the same chromosome to be inherited 
together through meiosis {Genetics in Medicine, Fifth Edition, (1991) Thompson, M.W. et al., 
W.B. Saunders Co. Philadelphia). A lod score of +3 or greater (1000:1 odds in favor of linkage) 
indicates a probability of 1 in 1000 that a particular marker was found solely by chance in affected 
5 individuals, which is strong evidence that two genetic loci are linked. 

One such gene implicated in PD is PARK3, which maps to 2pl3 (Gasser, T. et al. (1998) 
Nature Genet. 1 8:262-265). A marker at chromosomal position D2S441 was found to have a lod 
score of 3.2 in die region of PARKS. This marker supported the disease association of PARK3 
in the chromosomal interval from D2S134 to D2S286 (Gasser et al., su^ra). Markers located 
10 within chromosomal intervals D2S134 and D2S286, which map between 83.88 to 94.05 
centiMorgans on the short arm of chromosome 2, were used to identify genes that map in the 
region between D2S134 and D2S286. 

A second PD gene, implicated in early-onset recessive parkinsonism, is PARK6, located 
on chromosome 1 at Ip35-lp36. Several markers were obtained with lod scores greater than 3 
15 including D1S199, D1S2732. D1S2828, D1S478, D1S2702, D1S2734, D1S2674 (Valente, E.M. 
et al, suEm). These markers were used to determine the PD-relevant range of chromosome loci 
and identify sequences that map to chromosome 1 between D1S199 and D1S2885. 

Restriction fragment length polymorphism (RFLP) markers shown to be near regions of DNA 
known as sequence-tagged sites (STS). have been mapped to NT_Contigs generated by the Human 
20 Genome Project using ePCR (Schuler, G.D. (1997) Genome Research 7: 541-550, and (1998) Trends 
Biotechnol. 16(1 1):456-9). Contigs containing regions of DNA with known disease-associated 
markers are therefore used to identify CGDD sequences that map to disease-associated regions of the 
genome. 

Polynucleotides encoding CGDD were mapped to NT_Contigs. Contigs longer than 1Mb 
25 were broken into subcontigs of 1Mb length with overlaping sections of lOOkb. A preliminary step 
used an algorithm, similar to MEGABLAST, to define the mRNA sequence /masked genomic DNA 
contig pairings. The cDNA/genomic pairings identified by the first algorithm were confirmed, and 
the CGDD polynucleotides mapped to DNA contigs, using SIM4 (Horea, L. et al. (1998) Genome 
Res. 8:967-74, version May 2000) which had been optimized for high throughput processing and 
strand assignment confidence). The SIM4 output of the mRNA sequence/genomic contig pairs was 
further processed to determine the correct location of the CGDD polynucleotides on the genomic 
contig, as well as their strand identify. 

Both SEQ ID NO:38 and SEQ ID NO:48 were mapped to NT_Contig NT_004782 from 
Genbank release February. 2002. covering a 14.87 Mb region of the genome that also contains PD- 
associated genetic markers D1S199 and D1S2885. The maximum distance between both SEQ ID 
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NO:38 and SEQ ID NO:48 and markers D1S199 and D1S2885, therefore, is 14.87 Mb. Thus, both 
SEQ ID NO:38 and SEQ ID NO:48 are in proximity with genetic markers shown to consistently 
associate with PD. 

Vn. Analysis of Polynucleotide Expression 

5 Northern analysis is a laboratory technique used to detect the presence of a transcript of a 

gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound (Sambrook and Russell, supra, ch. 7; Ausubel et 
al., supra, ch. 4). 

Analogous computer techniques applying BLAST were used to search for identical or related 
10 molecules in databases such as GenBank or LIFESEQ (Incyte Genonaics). This analysis is much 
faster than multiple membrane-based hybridizations, hi addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

15 BLAST Score x Percent Identity 

5 x minimum {length(Seq. 1), lengdi(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 

20 calculated as foUows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate 

25 the product score. The product score represents a balance between fractional overlap and quality in a 
BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 

30 identity and 100% overlap. 

Altematively, polynucleotides encoding CGDD are analyzed with respect to the tissue 
sources from which they were derived. For example, some full length sequences are assembled, at 
least in part, with overlapping Incyte cDNA sequences (see Example M). Each cDNA sequence is 
derived from a cDNA library constructed from a human tissue. Each human tissue is classified into 

35 one of the following organ/tissue categories: cardiovascular system; connective tissue; digestive 
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system; embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; 
gem cells; hemic and inmiune system; liver; musculoskeletal system; nervous system; pancreas; 
respiratory system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract. 
The number of libraries in each category is counted and divided by the total number of libraries 
5 across all categories. Similarly, each human tissue is classified into one of the following 

disease/condition categories: cancer, cell line, developmental, inflammation, neurological, trauma, 
cardiovascular, pooled, and other, and the number of libraries in each category is counted and divided 
by the total number of libraries across all categories. The resulting percentages reflect the tissue- and 
disease-specific expression of cDNA encoding CGDD. cDNA sequences and cDNA library/tissue 

10 information are found in the LIFESEQ GOLD database (Mcyte Genomics, Palo Alto CA). 
Vni. Extension of CGDD Encoding Polynucleotides 

Full length polynucleotides are produced by extension of an appropriate fragment of the full 
length molecule using oUgonucleotide primers designed from this fragment. One primer was 
synthesized to initiate 5' extension of the known fragment, and the other primer was synthesized to 

15 initiate 3' extension of the known fragment. The initial primers were designed usmg OLIGO 4.06 
software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

20 Selected human cDNA libraries were used to extend the sequence. If more than one 

extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR usmg methods well known m the art. PGR 
was performed in 96-well plates usmg the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg^*, {NH4)2S04, 

25 and 2-mercaptoethanol, Taq DNA polymerase (Amersham Biosciences), ELONGASE enzyme 

(Invitrogen), and Pfti DNA polymerase (Stratagene), with the following parameters for primer pair 
PCI A and Pa B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 mm; Step 4; 68°C, 2 
niin; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2: 

30 94°C, 15 sec; Step 3: 57^, 1 min; Step 4: eS^C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 mm; Step 7: storage at 4''C. 

The concentration of DNA m each well was detennined by dispensing 100 jul PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 /tl of undiluted PGR product into each well of an opaque fluorimeter plate (Coming Costar, 

35 Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan U 
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(Labsystems Oy. Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 mI to 10 u.\ aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

5 The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 

digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison Wl), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Biosciences). For shotgun 
sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, 
firagments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
10 religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 

Biosciences), treated with Pfii DNA polymerase (Stratagene) to fill-in restriction site overfiangs, and 
transfected into competent E. coli ceUs. Transformed cells were selected on antibiotic-containmg 
media, and individual colonies were picked and cultured overnight at 37°C m 384-weil plates in 
LB/2x carb liquid media. 

The cells were lysed, and DNA was anq)lified by PGR using Taq DNA polymerase 
(Amersham Biosciences) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 
1: 94°C. 3 min; Step 2: 94X, 15 sec; Step 3: 60°C. 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, 
and 4 repeated 29 times; Step 6: 72»C. 5 min; Step 7: storage at 4°C. DNA was quantified by 
PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries 
were reamplified using the same conditions as described above. Samples were dUuted with 20% 
dimethysulfoxide (1:2. v/v). and sequenced using DYENAMIC energy transfer sequencing primers 
and the DYENAMIC DIRECT kit (Amersham Biosciences) or the ABI PRISM BIGDYE Terminator 
cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, fiill length polynucleotides are verified using the above procedm-e or are used 
to obtain 5' regulatory sequences using the above procedure along with oligonucleotides designed for 
such extension, and an appropriate genomic library. 

IX. Identification of Single Nucleotide Polymorphisms in CGDD Encoding Polynucleotides 

Common DNA sequence variants known as single nucleotide polymorphisms (SNPs) wer^ 
identified in SEQ ID NO:34-66 usmg the LIFESEQ database (Incyte Genomics). Sequences from the 
same gene were clustered together and assembled as described in Example HI, aUowing the 
identification of all sequence variants in the gene. An algorithm consisting of a series of filters was 
used to distinguish SNPs from other sequence variants. Preliminary filters removed the majority of 
basecall errors by requiring a mmimum Phied quality score of 15, and removed sequence alignment 
errors and errors resulting fiiom improper trimming of vector sequences, chimeras, and splice 
variants. An automated procedure of advanced chromosome analysis analysed the original 
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chromatogram files in the vicinity of the putative SNP. Clone error filters used statistically generated 
algorithms to identify errors introduced during laboratory processing, such as those caused by reverse 
transcriptase, polymerase, or somatic mutation. Qustering error filters used statistically generated 
algorithms to identify errors resulting from clustering of close homologs or pseudogenes, or due to 
5 contamination by non-human sequences. A final set of filters removed duplicates and SNPs found in 
immunoglobulins or T-cell receptors. 

Certain SNPs were selected for further characterization by mass spectrometry using the high 
throughput MASSARRAY system (Sequenom, Inc.) to analyze allele frequencies at the SNP sites m 
four different human populations. The Caucasian population comprised 92 individuals (46 male, 46 
10 female), including 83 from Utah, four French, three Venezualan, and two Amish individuals. The 
African population comprised 194 individuals (97 male, 97 female), all African Americans. The 
Hispanic population comprised 324 individuals (162 male, 162 female), all Mexican Hispanic. The 
Asian population comprised 126 individuals (64 male, 62 female) with a reported parental breakdown 
of 43% Chinese, 31% Japanese, 13% Korean, 5% Vietnamese, and 8% other Asian. ADele 
15 frequencies were fiorst analyzed in the Caucasian population; m some cases fliose SNPs which showed 
no allelic variance in this population were not further tested in the other three populations. 
X. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:34-66 are enq)loyed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
20 pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250yuCi of 
[y-^^P] adenosine triphosphate (Amersham Biosciences), and T4 polynucleotide kinase (DuPont MEN, 
Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 
25 superfine size exclusion dextran bead column (Amersham Biosciences). An aliquot contaming 10^ 
counts per minute of the labeled probe is used m a typical membrane-based hybridization analysis of 
human genomic DNA digested with one of the following endonucleases: Ase I, Bgl n, Eco RI, Pst I, 
Xba I, or Pvu n (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
30 membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization pattems are visualized using autoradiography or an alternative imaging uMans and 
compared. 
35 XI. Microarrays 
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The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing; see, e.g., Baldeschweiler et al, supra), 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena, M., ed. 
^^^^^^ DNAMicroarravs: A Practical Ap prn^rh, Oxford University Press, London). Suggested 
substrates include silicon, siUca, glass slides, glass chips, and silicon wafers. Alternatively, a 
procedure analogous to a dot or slot blot may also be used to arrange and link elements to the surface 
of a substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may 
be produced using available methods and machines weU known to those of ordinary skill in the art 
and may contain any appropriate number of elements (Schena, M. et al. (1995) Science 270:467^70; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. 
Biotechnol. 16:27-31). 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element 
on the microarray may be assessed. In one embodiment, microarray preparation and usage is 
described in detail below. 
Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidmium thiocyanate method and 
poly(A)^ RNA is purified using the oUgo-(dT) ceUulose method. Each poly(A)^ RNA sample is 
reverse transcribed usmg MMLV reverse-transcriptase, 0.05 pg/^1 ohgo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units//il RNase inhibitor, 500 dATP, 500 dGTP, 500 /iM dTTP, 40 
dCTP. 40 AIM dCTP-Cy3 (BDS) or dCTP-CyS (Amersham Biosciences). The reverse transcription 
reaction is performed in a 25 ml volume containing 200 ng poly(A)" RNA with GEMBRIGHT kits 
(Incyte Genomics). Specific control poly(A)- RNAs are synthesized by in vitro transcription from 
non-codmg yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one with 
Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated 
for 20 minutes at SS'C to the stop the reaction and degrade the RNA. Samples are purified using two 
successive CHROMA SPIN 30 gel filtration spin columns (Clontech, Palo Alto CA) and after 
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combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml 
sodium acetate, and 300 mi of 100% ethanol. The sample is then dried to completion using a 
SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 nl 5X SSC70.2% SDS. 
Microarrav Preparation 
5 Sequences of the present invention are used to generate array elements. Each array element 

is amplified from bacterial cells containing vectors with cloned cDNA inserts. PGR amplification 
uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PGR from an initial quantity of 1-2 ng to a final quantity greater than 5 
lig. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Biosciences). 

10 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Goining) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Ghester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Goated slides ate cured in a 110 "C 

15 oven. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 ftl of the array element DNA, at an average 
concentration of 100 ngZ/il, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

20 Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60°G followed by washes in 
0.2% SDS and distilled water as before. 

2S Hybridization 

Hybridization reactions contain 9 fil of sample mixture consistmg of 0.2 fig each of Gy3 and 
Gy5 labeled cDNA syndiesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1 .8 cm^ coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
30 larger than a microscope slide. The chamber is kept at 100% humidity mtemally by the addition of 
140 jxl of 5X SSC in a comer of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60° C. The arrays are washed for 10 min at 45° C in a iirst wash buffer (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45° C in a second wash buffer (O.IX SSC), and dried. 
Detection 

35 Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
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mnova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 om for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., MelviUe NY). The slide 
contaming the array is placed on a computer-controlled X-Y stage on the microscope and xaster- 
5 scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
10 filters positioned between the array and the photomultiplier tubes are used to filter the signals The 
emission maxima of the fluorophores used are 565 nm for CyS and 650 mn for Cy5. Each aixay is 
typically scamied twice, one scan per fluorophore using the appropriate filters at die laser source 
although the apparatus is capable of recording the spectra from both fluorophores sunultaneously. 
The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
15 cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated witi, a weight ratio of hybridizing species of 1 : 100.000. When two samples 
from different sources (e.g., representing test and control cells), each labeled witii a diffei^t 
fluorophoie, are hybridized to a single array for the purpose of identifying genes that are 
differentially expressed, the calibration is done by labeling samples of the caUbrating cDNA with die 
two fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of tiie photomultiplier tube is digitized using a 12-bit RTE-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) mstalled in an IBM-compatible PC 
computer. The digitized data are displayed as an onage where ti,e signal intensity is mapped using a 
hnear 20-color transfonnation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, tiie data are first corrected for optica] crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore's emission spectmm. 

A grid is superimposed over die fluorescence signal unage such Aat the signal from each spot 
IS centered in each element of the grid. The fluorescence signal within each element is then 
mtegtated to obtain a numerical value corresponding to the average mtensity of the signal The 
software used for signal analysis is ti,e GEMTOOLS gene expression analysis program (Incyte 
Genomics). Array elements tiiat exhibit at least about a two-fold change in expression, a signal-to- 
background ratio of at least about 2.5. and an element spot size of at least about 40%. are considered 
35 to be differentially expressed. 
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Expression 

For example, SEQ ID NO:38 was significantly under-expressed in the breast tumor cell lines 
when compared to a normal breast cell Une. The gene expression profile of a nonmalignant mammary 
epithelial cell line was compared to the gene expression profiles of breast carcinoma Unes at different 
5 stages of tumor progression. Cell lines compared included: a) BT-20, a breast carcinoma cell line 
derived in vjtiQ from the cells emigrating out of thin slices of tumor mass isolated from a 74-year-old 
female, b) BT-474, a breast ductal carcinoma cell line that was isolated from a solid, invasive ductal 
carcinoma of the breast obtained from a 60-year-old woman, c) BT-483, a breast ductal carcinoma 
cell line that was isolated from a papiUary invasive ductal tumor obtained from a 23-year-old normal, 

10 menstruating, parous female with a family history of breast cancer, d) Hs 578T, a breast ductal 
carcinoma cell line isolated from a 74-year-old female with breast carcinoma, e) MCF7, a 
nonmalignant breast adenocarcinoma cell line isolated from the pleural effusion of a 69-year-old 
female, 0 MCF-lOA, a breast mammary gland (luminal ductal characteristics) cell line isolated, from 
a 36-year-old woman witii fibrocystic breast disease, g) MDA-MB-468, a breast adenocarcinoma ceU 

15 line isolated from the pleural effusion of a 5 1 -year-old female widj metastatic adenocarcinoma of the 
breast, and h) HMEC, a primary breast epithelial cell line isolated from a normal donor. For example, 
SEQ ID NO:38 was decreased at least 2.3-fold in five (BT-474, BT-483, MCF-lOA, MCF7, and BT- 
20) of the seven ceU lines tested (described above) when compared with HMEC cells whether grown 
in medium with or without growth factors and nutrients. Therefore, in various embodunents, SEQ ID 
20 NO:38 cao be used for one or more of the following: i) monitoring treatment of breast cancer, ii) 
diagnostic assays for breast cancer, and iii) developing therapeutics and/or other treatments for breast 



cancer. 



As with most tumors, prostate cancer develops through a multistage progression ultimately 
resulting in an aggressive tumor phenotype. The initial step in tumor progression mvolves the 

25 hyperproliferation of normal luminal and/or basal epithehal cells. Androgen responsive ceUs become 
hyperplastic and evolve into early-stage tumors. Although early-stage tumors are often androgen 
sensitive and respond to androgen ablation, a population of androgen independent cells evolve from 
the hyperplastic population. These cells represent a more advanced form of prostate tumor that may 
become invasive and potentially become metastatic to the bone, brain, or lung. PrEC is a primary 

30 prostate epitheUal ceU line isolated from a normal donor. DU 145 is a prostate carcinoma cell Une 
isolated from a metastatic site in the brain of 69-year old male with widespread metastatic prostate 
carcinoma. DU 145 has no detectable sensitivity to honnones; forms colonies in semi-solid medium; 
is only weakly positive for acid phosphatase; and ceUs are negative for prostate specific antigen 
(PSA). LNCaP is a prostate carcinoma cell Une isolated from a lymph node biopsy of a 50-year-old 

35 male with metastatic prostate carcinoma. LNCaP cells express prostate specific antigens, produce 
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prostatic acid pliosphatase, and express androgen receptors. PC-3 is a prostate adenocarcinoma cell 
line that was isolated from a metastatic site in the bone of a 62- year-old male with grade IV prostate 
adenocarcinoma. Starved: cells were grown in basal media in the absence of growth factors and 
hormones. In a further example, expression of Seq ID NO:38 was decreased at least two-fold in two 
5 (DU145 and LNCaP) of the three cell lines tested (described above) when compared with PrEC cells. 
Therefore, in various embodiments, SEQ ID NO:38 can be used for one or more of the following: i) 
monitoring treatment of prostate cancer, ii) diagnostic assays for prostate cancer, and iii) developing 
therapeutics and/or other treatments for prostate cancer. 
Preparation of Monocytes and Immature Dendritic Cells 
10 Peripheral blood mononuclear cells (PBMCs) were isolated from freshly obtained peripheral 

blood of four healthy donors by centrifugation of the lymphocyte enriched blood fraction over a 
PIYPAQUE ficoll gradient (Sigma-Aldrich). The PBMCs were allowed to adhere to plastic in 
Iscove's Modified Dulbecco's Medium supplemented with 10% fetal bovine serum, 200 nM 
glutamine, and 200 nM each penicillin and streptomycin for 4 hours to separate monocytes from other 
15 nonadherent cells. One-half of the monocytes was cultured for seven days in 10 ng/ml GM-CSF 
(Peprotech Inc., Rocky Hill NJ), and the other half was cultured for seven days with 10 ng/ml GM- 
CSF and 10 ng/ml IL-4 (Peprotech Inc.) to produce immature dendritic ceUs. 
Preparation of Mature Dendritic Cells 

Monocytes were isolated as described above and were incubated with 10 ng/ml GM-CSF and 
20 10 ng/ml IL-4 for 13 days to generate immature DC. The DC were activated with anti-CD40 

(Biodesign International, Kennebunk ME) at 10 ng/ml for 24 hours. The DC were divided into half, 
and each half further divided into thirds. Each third was cultured with soluble human TRANCE 
protein (Peprotech Inc.) at 10 ng/ml for 2, 8, and 24 hours, respectively, or cultured without TRANCE 
for 2, 8, and 24 hours, respectively. In yet a further example, SEQ ID NO:38 was mcreased at least 
25 8.5-fold in dendritic ceUs. Therefore, in various embodiments, SEQ ID NO:38 can be used for one or 
more of the following: i) monitoring treatment of unmune disorders and related diseases and 
conditions, ii) diagnostic assays for immune disorders and related diseases and conditions, and iii) 
developing therapeutics and/or other treatments for immune disorders and related diseases and 
conditions. 

30 In another example, the expression of SEQ ID NO:40 was increased at least four-fold in 

cancerous lung tissue compared to normal tissue from the same donor. Lung squamous cell 
carcinoma tissue was obtained from a 68-year-old female and matched with normal lung tissue 
obtained from grossly uninvolved tissue fixim the same donor. Therefore, in various embodiments, 
SEQ ID NO:40 can be used for one or more of the following: i) monitoring treatment of lung 

35 squamous ceU carcinoma, u) diagnostic assays for lung squamous cell carcinoma, and iii) developing 
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therapeutics and/or other treatments for lung squamous cell carcinoma. 

In another example, the expression of SEQ ID NO:42 was decreased at least two-fold in 
cancerous lung tissue compared to normal tissue from the same donor. Lung squamous cell 
carcinoma tissue was obtained from a 75-year-old female and matched with normal lung tissue 
obtained from grossly uninvolved tissue from the same donor. Therefore, in various embodiments, 
SEQ ID NO:42 can be used for one or more of the following: i) monitoring treatment of lung 
squamous cell carcinoma, ii) diagnostic assays for lung squamous cell carcmonaa, and iii) developing 
therapeutics and/or other treatments for lung squamous cell carcinoma. Matched normal and 
tumorigenic lung tissue samples for SEQ ID NO:40 and SEQ ID NO:42 are provided by the Roy 
Castle Ljtemationai Centre for Lung Cancer Research, Liverpool UK). 

In another example, Jurkat is an acute T cell leukemia cell line that grows actively in the 
absence of external stimuli. Jurkat has been extensively used to study signaling in human T cells. 
PMA is a broad activator of the protein kinase C-dependent pathways. lonomycin is a calcium 
ionophore that permits the entry of calcium in the cell, hence increasing the cytosolic calcium 
concentration. The combination of PMA and ionomycui activates two of the major signaling 
pathways used by mammalian cells to interact with their environment. In T cells, the combination of 
PMA and ionomycin mimics the type of secondary signaling events elicited during optimal B cell 
activation. The expression of SEQ ID NO:42 was downregulated at least two-fold m the Jurkat T-cell 
leukemia cell line that had been stimulated for one hour with 1/iM PMA (phorbol 12-myristate 
13-acetate) and with ionomycin concentrations varying between 50 ng/ml and 10 ^g/ml when 
compared to untreated Jurkat cells in the absence of stimuli. Therefore, in various embodiments, SEQ 
ID NO:42 can be used for one or more of the following: i) monitoring treatment of T cell leukemia, ii) 
diagnostic assays for T cell leukemia and in signalmg assays in human T cells, and iii) developing 
therapeutics and/or other treatments for T cell leukemia. 

In a further example, SEQ ID NO:42 was decreased at least two-fold in one of seven cell lines 
(HMEC, MCF7, MDA-mb-231, Sk-BR-3, MDA-mb-435S, BT-20. and T^TD) when conq)ared with 
MCFIOA cells (See descriptions above.). Therefore, in various embodiments, SEQ ID NO:42 can be 
used for one or more of the following: i) monitoring treatment of breast cancer, ii) diagnostic assays 
for breast cancer, and iii) developing therapeutics and/or other treatments for breast cancer. 

In another example, expression of SEQ ID NO:47 was decreased at least two-fold in four out 
of five squamous cell carcinoma tissues and m one lung adenocarcinoma as determined by microartay 
analysis. Normal lung tissue from the donors was compared to lung tumor from the same donor. 
Donors are as follows: lung squamous cell carcinoma tissue was obtained from a 68-year-old female, 
a 66-year-old male, and two different 73-year-old males; lung adenocarcinoma tissue was obtained 
from a 66-year-oId female. Matched normal and tumorigenic lung tissue samples for SEQ ID NO:47 
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are provided by tlie Roy Castle Intemational Centre for Lung Cancer Research, Liverpool UK). 
Therefore, in various embodiments, SEQ TD NO:47 can be used for one or more of the following: i) 
monitoring treatment of lung cancer and other cell proliferative disorders, ii) diagnostic assays for 
lung cancer and other cell proliferative disorders, and iii) developing therapeutics and/or other 
5 treatments for lung cancer and other cell proliferative disorders. 

Human peripheral blood mononuclear cells (PBMC) can be classified into discrete cellular 
populations representing the major cellular components of the immune system. PBMCs contain about 
52% lymphocytes (12% B lymphocytes, 40 % T lymphocytes {25 % CD4+ and 15 % CD8+}), 20% 
NK cells, 25% monocytes, and 3% various other cells including dendritic ceUs and progenitor cells. 
10 The proportions, as well as the biology of these cellular components tend to slightly vary between 
healthy individuals, depending on factors such as age, gender, past medical history, and genetic 
background. PBMC from the blood of 6 healthy volunteer donors were incubated for 24 hours in the 
presence of graded doses of beclomethasone dissolved in DMSO or in DMSO alone. The treated 
PBMC were compared to matching untreated PBMC maintained in culture for the same duration. In a 
15 further example, expression of SEQ ID NO:48 was down regulated at least two-fold in PBMC cells 
treated with doses equal to or higher than 1 ^M (1 /xM, 5 ^M, and 25 fiM) of beclomethasone as 
determined by microarray analysis. Therefore, in various embodmients, SEQ ID NO:48 can be used 
for one or more of the following: i) monitoring treatment of immune disorders and related diseases 
and conditions, ii) diagnostic assays for immune disorders and related diseases and conditions, and 
20 iii) developing therapeutics and/or other treatments for immune disorders and related diseases and 
conditions. 

SEQ ID NO:58 was differentiaUy expressed in human peripheral blood mononuclear ceUs 
(PBMCs) treated with SEB (Staphylococcal enterotoxin B), when compared to untreated PBMC cells. 
PBMCs can be classified mto discrete ceUular populations representing the major cellular 

25 components of the immune system. PBMCs contain about 52% lymphocytes (12% B lymphocytes, 
40% T lymphocytes {25% CD4+ and 15% CD8+}), 20% NK ceUs, 25% monocytes, and 3% various 
cells that include dendritic cells and progenitor cells. The proportions, as well as the biology of these 
cellular components tend to vary slightly between healthy mdividuals, depending on factors such as 
age, gender, past medical history, and genetic background. 

30 Staphylococcal exotoxins specificaUy activate human T cells, expressing an appropriate TCR- 

Vbeta chain. Although polyclonal in nature, T cells activated by Staphylococcal exotoxins require 
antigen presenting cells (APCs) to present the exotoxin molecules to the T cells and dehver the 
costimulatoiy signals required for optimum T cell activation. Although, Staphylococcal exotoxms 
must be presented to T cells by APCs, these molecules are not required to be processed by APC. 

35 Indeed, Staphylococcal exotoxins duectly bind to a non-polymoiphic portion of the human MHC 
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class n molecules, bypassing the need to capture, cleavage, and binding of the peptides to the 
polymorphic antigenic groove of the MHC class II molecules. 

To evaluate the variation in gene expression in the PBMCs from healthy donors in response 
to treatment with SEB, the PBMCs from healthy volunteer donors were stimulated in vitro with SEE. 
SEQ ID NO:58 was overexpressed by at least two-fold in PBMC cells treated with SEB for 24 hours 
and for 72 hours. SEQ ID NO:58 was overexpressed by at least two-fold in PBMC cells treated with 
SEB for 72 hours. Therefore, in various embodiments, SEQ ID NO:58 can be used for one or mote of 
the following: i) monitoring treatment of immune disorders and related diseases and conditions, ii) 
diagnostic assays for immune disorders and related diseases and conditions, and iii) developing 
therapeutics and/or other treatments for inmiune disorders and related diseases and conditions. 

SEQ ID NO:58 was differentially expressed in human colon tissue compared to normal colon 
tissue. While soft tissue sarcomas are relatively rare, more than 50% of new patients diagnosed with 
the disease will die from it. The molecular pathways leading to the development of sarcomas are 
relatively unknown, due to the rarity of the disease and variation in pathology. It is likely that 
numerous gene expression differences exist between sarcomas and normal tissue. To begin to 
delineate the pathways that might lead to sarcoma formation, differences in gene expression were 
examined between a colon tumor derived from a gastric sarcoma and normal colon tissue from the 
same donor. SEQ ID NO:58 was overexpressed by at least two-fold in colon tumor tissue as 
compared to normal colon tissue from the same donor. Therefore, in various embodiments, SEQ ID 
NO:58 can be used for one or more of the following: i) monitoring tteatment of colon cancer, ii) 
diagnostic assays for colon cancer, and iii) developing therapeutics and/or other treatments for colon 
cancer. 

SEQ ID NO:53 and SEQ ID NO:58 were differentially expressed in human lung tissue 
compared to normal lung tissue. Lung cancers are divided mto four histopathologically distinct 
groups. Three groups (squamous cell carcinoma, adenocarcinoma, and large cell carcinoma) are 
classified as non-snoall cell lung cancers (NSCLCs). The fourth group of cancers is referred to as 
small ceU lung cancer (SCLC). Collectively. NSCLCs account for approximately 70% of cases while 
SCLCs account for approximately 18% of cases. The molecular and cellular biology underlying the 
development and progression of lung cancer are incompletely understood. Deletions on 
chromosomes are common in this disease and are thought to indicate the presence of a tumor 
suppressor gene m this region. Activating mutations in K-ras are commonly found in lung cancer and 
are the basis of one of the mouse models for the disease. Analysis of gene expression patterns 
associated with the development and progression of the disease wiU yield tremendous insight into the 
biology underlying this disease. Further, SEQ ID NO:53 and SEQ ID NO:58 were overexpressed by 
at least two-fold in two pair comparison experiments in which tissue from squamous cell carcinoma 
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of the lung was compared to uninvolved lung tissue, from the same donor. Therefore, in various 
embodiments, SEQ ID NO:53 and SEQ ID NO:58 can be used for one or more of the following: i) 
monitoring treatment of lung cancer, ii) diagnostic assays for lung cancer, and iu) developing 
therapeutics and/or other treatments for lung cancer. 

In a further example, the expression of SEQ ID NO:64 was increased at least two-fold in 
cancerous colon tissue compared to normal tissue from the same donor. Colon tissue was obtained 
from a 38-year-old male wjith invasive, poorly differentiated adenocarcinoma with metastases to two 
out of 13 lymph nodes surveyed, and matched with normal colon tissue obtained from grossly 
unmvolved colon tissue from the same donor. Matched normal and tumorigenic colon tissue samples 
are provided by the Huntsman Cancer Institute (Salt Lake City, UT). Therefore, in various 
embodiments, SEQ ID NO:64 can be used for one or more of the following: i) monitoring treatment 
of colon cancer, ii) diagnostic assays for colon cancer, and iii) developing therapeutics and/or other 
treatments for colon cancer. 
Xn. Complementary Polynucleotides 
15 Sequences complementary to the CGDD-encoding sequences, or any parts thereof, are used to 

detect, decrease, or inhibit expression of naturally occurring CGDD. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the codmg sequence of CGDD. To 
20 inhibit transcription, a complementaiy oligonucleotide is designed from the most unique 5* sequence 
and used to prevent promoter binding to the coding sequence. To mhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to the CGDD-encoding 
transcript. 

Xm. Expression of CGDD 

25 Expression and purification of CGDD is achieved using bacterial or virus-based expression 

systems. For expression of CGDD in bacteria, cDNA is subcloned mto an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that dfrects high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trjy-lac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter m conjunction with the lac operator regulatory 

30 element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express CGDD upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of CGDD in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Autographica califomica nuclear polyhedrosis vims 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 

35 replaced with cDNA encoding CGDD by either homologous recombination or bacterial-mediated 
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transposition involving transfer plasraid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodopterafrugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovkus (Engelhard, E.K. et al. 
(1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 
1945). 

In most expression systems, CGDD is synthesized as a fusion protein with, e.g., glutadiione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permittmg rapid, smgle-step. 
affinity-based purification of recombinant fusion protein from crude ceU lysates. GST, a 26- 
kilodalton enzyme from Schistosor,mjaponicum, enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protem activity and antigenicity (Amersham 
Biosciences). Following purification, the GST moiety can be proteolytically cleaved from CGDD at 
specificaUy engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commerciaUy available monoclonal and polyclonal anfi-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 
(QIAGEN). Methods for protem expression and purification are discussed m Ausubel et al. {supra, 
ch. 10 and 16). Purified CGDD obtained by these methods can be used directly in the assays shown 
in Examples XVH and XVDI, where applicable. 
XIV. Functional Assays 

CGDD function is assessed by expressing the sequences encoding CGDD at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned mto a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include PCMV SPORT plasmid (Invitrogen. Carlsbad CA) and PCR3.1 plasmid (Invitrogen). both of 
which contam the cytomegalovirus promoter. 5-10 ixg of recombinant vector are transiently 
transfected into a human cell line, for example, an endotheUal or hematopoietic cell Ime, usmg either 
liposome formulations or electtoporation. 1-2 ^g of an additional plasmid containing sequences 
encoding a marker protein are co-transfected. Expression of a marker protein provides a means to 
distinguish transfected cells from nontransfected ceUs and is a reliable predictor of cDNA expression 
from the recombinant vector. Marker proteins of choice include. e.g.. Green Huorescent Protein 
(GFP; Clontech). CD64. or a CD64-GFP fusion protem. Flow cytometry (FCM), an automated, laser 
optics-based technique, is used to identify fransfected ceUs expressmg GFP or CD64-GFP and to 
evaluate the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the 
uptake of fluorescent molecules that diagnose events precedmg or coincident with ceU death. These 
events include changes m nuclear DNA content as measured by staining of DNA with propidium 
iodide; changes in cell size and granularity as measured by forward light scatter and 90 degree side 
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light scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine 
uptake; alterations in expression of cell surface and intracellular proteins as measured by reactivity 
with specific antibodies; and alterations in plasmia membrane composition as measured by the binding 
of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
5 discussed in Ormerod, M.G. (1994; Flow Cytometry . Oxford, New York NY). 

The influence of CGDD on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding CGDD and either CD64 or CD64-GFP. CD64 and 
CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated firomnontransfected cells using 

10 magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art. 
Expression of mRNA encodmg CGDD and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 
XV. Production of CGDD Specific Antibodies 

15 CGDD substantially purified usmg polyacrylamide gel electrophoresis (PAGE; see, e.g., 

Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize animals (e.g., rabbits, mice, etc.) and to produce antibodies using standard protocols. 

Altematively, the CGDD amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high inimunogenicity, and a corresponding oligopeptide is 

20 synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described m the art (Ausubel et al., supra, ch. 1 1). 

Typically, oligopeptides of about 15 residues in length are syndiesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 

25 Aldrich, St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity (Ausubel et al., supra). Rabbits are unmunized with the oUgopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-CGDD 
activity by, for example, binding the peptide or CGDD to a substrate, blocking with 1% BSA, reacting 
with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

30 XVI. Purification of Naturally Occurring CGDD Using Specific Antibodies 

Naturally occurring or recombinant CGDD is substantially purified by immunoaffinity 
chromatography usmg antibodies specific for CGDD. An immunoaffinity column is constructed by 
covalently coupling anti-CGDD antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Biosciences). After the coupling, the resin is blocked and 

35 washed according to the manufacturer's instructions. 
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Media containing CGDD are passed over the inmiunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of CGDD (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/CGDD binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and CGDD is collected. 
XVII. Identification of Molecules Which Interact with CGDD 

CGDD, or biologically active fragments thereof, are labeled with '-^I Bolton-Hunter reagent 
(Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539). Candidate molecules previously 
arrayed in the wells of a multi-well plate are incubated with the labeled CGDD, washed, and any 
wells with labeled CGDD complex are assayed. Data obtained using different concentrations of 
CGDD are used to calculate values for the number, affinity, and association of CGDD with the 
candidate molecules. 

Alternatively, molecules interacting with CGDD are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989; Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

CGDD may also be used m the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large Ubraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 
XVni. Demonstration of CGDD Activity 

CGDD activity is demonstrated by measuring the induction of terminal differentiation or cell 
cycle progression when CGDD is expressed at physiologicaUy elevated levels in mammalian cell 
culture systems. cDNA is subcloned into a mammalian expression vector containmg a strong 
promoter that drives high levels of cDNA expression. Vectors of choice include PCMV SPORT (Life 
Technologies, Gaithersburg, MD) and PGR 3, 1 (Invitrogen, Carlsbad, CA), both of which contain the 
cytomegalovirus promoter. 5-10 tig of recombinant vector are transiently transfected into a human 
cell Ime, preferably of endothelial or hematopoietic origin, using either liposome formulations or 
electroporation. 1-2 Atg of an additional plasmid contaming sequences encoding a marker protein are 
co-transfected. Expression of a marker protein provides a means to distinguish transfected cells from 
npntransfected ceils and is a reliable predictor of cDNA expression from the recombinant vector. 
Marker protems of choice include, e.g.. Green Fluorescent Protein (GFP) (Clontech, Palo Alto, CA), 
CD64, or a CD64-GFP fusion protein. How cytometry detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident with cell cycle progression or 
terminal differentiation. These events include changes in nuclear DNA content as measured by 
staining of DNA with propidium iodide; changes in cell size and granularity as measured by forward 
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light scatter and 90 degree side light scatter; up or down-regulation of DNA synthesis as measured by 
decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular 
proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane 
composition as measured by the binding of fluorescein-conjugated Annexin V protein to the cell 
surface. Methods in flow cytometry are discussed in Ormerod, M.G. (1994) Flow Cytometry . Oxford, 
New York, NY. 

Alternatively, an in vitro assay for CGDD activity measures the transformation of normal 
human fibroblast cells overexpressing antisense CGDD RNA (Garkavtsev, I. and K. Riabowol (1997) 
Mol. Cell Biol. 17:2014-2019). cDNA encoding CGDD is subcloned into the pLNCX retroviral 
vector to enable expression of antisense CGDD RNA. The resulting construct is transfected into the 
ecotropic BOSC23 virus-packaging cell line. Virus contained in the BOSC23 culture supernatant is 
used to infect the amphotropic CAK8 virus-packaging cell line. Virus contained in the CAK8 culture 
supernatant is used to infect normal human fibroblast (Hs68) ceUs. Infected cells are assessed for the 
following quantifiable properties characteristic of transformed cells: growth in cultuie to high density 
associated with loss of contact inhibition, growth in suspension or in soft agar, formation of colonies 
or foci, lowered serum requirements, and ability to induce tumors when injected into immunodeficient 
mice. The activity of CGDD is proportional to the extent of transformation of Hs68 cells. 

Alternatively, CGDD can be expressed in a mammalian cell line by transforming the cells 
with a eukaryotic expression vector encoding CGDD. Eukaryotic expression vector are 
commercially available, and the techniques to introduce them into cells are well known to those 
skilled in the art. To assay the cellular localization of CGDD, cells are fractionated as described by 
Jiang, H.P. et al. (1992; Proc. Natl. Acad. Sci. 89:7856-7860). Briefly, cells pelleted by low-speed 
centrifiigation are resuspended in buffer (10 mM TRIS-HCl, pH 7.4/ 10 mM NaCU 3 mM MgCy 5 
mM EDTA with 10 ug/ml aprotinin, 10 ug/ml leupeptin, 10 ug/ml pepstatin A, 0.2 mM 
phenylmethylsulfonyl fluoride) and homogenized. The homogenate is centrifuged at 600 x g for 5 
minutes. The particulate and cytosol fractions are separated by ultracentrifugation of the supernatant 
at 100,000 X g for 60 minutes. The nuclear fraction is obtained by resuspending the 600 x g pellet in 
sucrose solution (0.25 M sucrose/ 10 mM TRIS-HCl, pH 7.4/ 2 mM MgCl^) and recentrifuged at 600 
X g. Equal amounts of protein from each fraction are applied to an SDS/10% polyacrylamide gel and 
blotted onto membranes. Western blot analysis is performed using CGDD anti-serum. The 
localization of CGDD is assessed by the intensity of the corresponding band in the nuclear fraction 
relative to the intensity in the other fractions. Alternatively, the presaoce of CGDD in cellular 
fractions is examined by fluorescence microscopy using a fluorescent antibody specific for CGDD. 

Alternatively, CGDD activity may be demonstrated as the ability to interact with its 
associated Ras superfamily protein, in an in vitro binding assay. The candidate Ras superfamily 
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proteins are expressed as fusion proteins with glutathione S-transferase (GST), and purified by 
affinity chromatography on glutathione-Sepharose. The Ras superfamily proteins are loaded with 
GDP by incubating 20 mM Tris buffer, pH 8.0, containing 100 mM NaCl, 2 mM EDTA, 5 mM 
MgC12, 0.2 mM DTT, 100 ^M AMP-PNP and 10 \M GDP at 30°C for 20 minutes. CGDD is 
5 expressed as a FLAG fusion protein in a baculovirus system. Extracts of these baculovirus cells 
containing CGDD-FLAG fusion proteins are precleared with GST beads, then incubated with GST- 
Ras superfamily fusion proteins. The con^lexes formed are precipitated by glutathione-Sepharose 
and separated by SDS-polyacrylamide gel electrophoresis. The separated proteins are blotted onto 
nitrocellulose membranes and probed with commercially available anti-FLAG antibodies. CGDD 

10 activity is proportional to the amount of CGDD-FLAG fusion protein detected in the conqjlex. 

Alternatively, as demonstrated by Li and Cohen (Li, L. and S.N. Cohen (1995) Cell 85:319- 
329), the ability of CGDD to suppress tumorigenesis can be measured by designing an antisense 
sequence to the 5' end of the gene and transfecting NIH 3T3 cells with a vector transcribing this 
sequence. The suppression of the endogenous gene will allow transformed fibroblasts to produce 

15 clumps of cells capable of forming metastatic tumors when introduced into nude mice. 

Alternatively, an assay for CGDD activity measures the effect of injected CGDD on the 
degradation of maternal transcripts. Procedures for oocyte collection from Swiss albino mice, 
injection, and culture are as described m Stutz et al., (supra). A decrease in the degradation of 
maternal RNAs as compared to control oocytes is indicative of CGDD activity. In the alternative, 

20 CGDD activity is measured as the ability of purified CGDD to bind to RNAse as measured by the 
assays described in Example XVII. 

Alternatively, an assay for CGDD activity measures syncytium formation in COS cells 
transfected with an CGDD expression plasmid, using the two-component fusion assay described in Mi 
(supra). This assay takes advantage of the fact that human iaterleukin 12 (IL-12) is a heterodimer 

25 comprising subunits with molecular weights of 35 kD (p35) and 40 kD (p40). COS cells transfected 
with expression plasmids carrying the gene for p35 are mixed with COS cells cotransfected with 
expression plasmids carrying the genes for p40 and CGDD. The level of IL-12 activity in the 
resultmg conditioned medium corresponds to the activity of CGDD in this assay. Syncytium 
formation may also be measured by light microscopy (Mi et al., supra). 

30 An altemative assay for CGDD activity measures cell proliferation as the amount of newly 

initiated DNA synthesis in Swiss mouse 3T3 cells. A plasmid containing polynucleotides encoding 
CGDD is transfected into quiescent 3T3 cultured cells using methods well known in the art. The 
transiently transfected cells are then incubated in the presence of pH] thymidine or a radioactive DNA 
precursor such as [a^^PJATP. Where applicable, varying amounts of CGDD ligand are added to the 

35 transfected cells. Incorporation of pH]thymidine into acid-precipitable DNA is measured over an 
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appropriate time interval, and the amount incorporated is directly proportional to the amount of newly 
synthesized DNA and CGDD activity. 

Alternatively, CGDD activity is measured by the cyclin-ubiquitin ligation assay (Townsley, 
F.M. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2362-2367). The reaction contains in a volume of 
10 Ml, 40 mM Tris.HCl (pH 7.6), 5 mM Mg CU, 0.5 mM ATP, 10 mM phosphocreatine, 50 of 
creatine phosphokinase/ml, 1 mg reduced carboxymethylated bovine serum albumin/ml, 50 /jM 
ubiquitin, 1 ubiquitin aldehyde, 1-2 pmol "^I-labeled cyclm B, 1 pmol El, 1 /M okadaic acid, 10 
Mg of protein of M-phase fraction lA (containing active E3-C and essentially free of E2-C), and 
varying amounts of CGDD. The reaction is incubated at 18 °C for 60 minutes. Samples are then 
separated by electrophoresis on an SDS polyacrylamide gel. The amount of '^I- cyclin-ubiquitin 
formed is quantified by PHOSPHORMAGER analysis. The amount of cyclin-ubiquitin formation is 
proportional to the activity of CGDD in the reaction. 

Alternatively, an assay for CGDD activity uses radiolabeled nucleotides, such as [a^^P]ATP, 
to measure either the incorporation of radiolabel into DNA during DNA synthesis, or fragmentation 
of DNA that accompanies apoptosis. Mammalian cells are transfected with plasmid contaming cDNA 
encoding CGDD by methods well known in the art. Cells are then mcubated with radiolabeled 
nucleotide for various lengths of time. Chromosomal DNA is collected, and radioactivity is detected 
using a scintillation counter. Incorporation of radiolabel into chromosomal DNA is proportional to 
the degree of stimulation of the cell cycle. To determine if CGDD promotes apoptosis, chromosomal 
DNA is collected as above, and analyzed using polyacrylamide gel electrophoresis, by methods well 
known in the art. Fragmentation of DNA is quantified by comparison to imtransfected control cells, 
and is proportional to the apoptotic activity of CGDD. 

Altematively, cyclophilin activity of CGDD is measured using a chymotrypsm-coupled assay 
to measure the rate of cis to trans mterconversion (Fischer, G. et al. (1984) Biomed. Biochim. Acta 
43: 1 101-11 11). The chymotrypsin is used to estimate the trans-substrate cleavage activity at Xaa-Pro 
peptide bonds, wherein the rate constant for the cis to trans isomerization can be obtained by 
measuring the rate constant of the substrate hydrolysis at the slow phase. Samples are incubated in 
the presence or absence of the immunosuppressant drugs CsA or FK506, reactions initiated by 
addition of chymotrypsin, and the fluorescent reaction measured. The enzymatic rate constant is 
calculated from the equation = + k^, wherein first order kinetics are displayed, and where 
one unit of PPIase activity is defined as k^^ (s '). 

Altematively, cyclophilin activity of CGDD is monitored by a quantitative immunoassay that 
measures its affinity for stereospecific binding to the immunosuppressant drug cyclosporin 
(Quesniaux, V.F. et al. (1987) Eur. J. Immunol. 17:1359-1365). In this assay, the cyclophilin- 
cyclosporin complex is coated on a solid phase, with bindmg detected using anti-cyclophihn rabbit 
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antiserum enhanced by an antiglobulin-enzyme conjugate. 

Alternatively, activity of CGDD is monitored by a binding assay developed to measure the 
non-covalent binding between FKBPs and immunosuppressant drugs in the gas phase using 
electrospray ionization mass spectrometry (Trepanier, DJ. et al. (1999) Ther. DrugMonit. 21:274- 
280). In electrospray ionization, ions are generated by creating a fine spray of highly charged droplets 
in the presence of a strong electric field; as the droplet decreases in size, the charge density on the 
surface increases. Ions are electrostatically directed into a mass analyzer, where ions of opposite 
charge are generated in spatially separate sources and then swept into capillary inlets where the flows 
are merged and where reactions occur. By comparing the charge states of bound versus unbound 
CGDD/immunosuppressive drug complexes, relative binding affinities can be established and 
correlated with in vitro binding and immunosuppressive activity. 

Alternatively, a fluorescence monitoring assay for detecting activated Ras using RRP22 is as 
follows. The RRP22 bindmg domain (RRP22BD) of c-Rafl (a kinase activated during reentry into 
meiosis) is synthesized fi-om two unprotected peptide segments by native chemical Ugation. Two 
fluorescent amino acids with structures based on the nitrobenz-2-oxa-l,3-diazole and coumaryl 
chromophores are incorporated close to the RRP22BD/RRP22-GTP bmding surface followed by 
introduction of a C-terminal tag consisting of His(6). The values for bmding of the 
site-specifically modified proteins to Ras-GTP are compared to that of wild-type RED. Ras-GTP is 
detected within the 100 nM range by immobilization of C-terminal His(6) tag-modified fluorescent 
RBD onto Ni-NTA-coated surfaces. Ras-GDP does not bind to the immobilized RBD, thus allowing 
discrimination between inactive and activated Ras (Becker, C. F. (2001) Chem. Biol. 8:243-252). 

Various modifications and variations of the described compositions, methods, and systems of 
the invention will be apparent to those skilled in the art without departing fi-om the scope and spirit of 
the invention. It will be appreciated that the invention provides novel and usefiil proteins, and their 
encoding polynucleotides, which can be used in the drug discovery process, as well as methods for 
using these compositions for the detection, diagnosis, and treatinent of diseases and conditions. 
Although the invention has been described m connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Nor should the description of such embodunents be considered exhaustive or limit the invention to 
the precise forms disclosed. Furthermore, elements firom one embodiment can be readily recombined 
with elements from one or more other embodiments. Such combinations can form a number of 
embodiments within the scope of the invention. It is intended that the scope of the invention be 
defined by the following claims and their equivalents. 
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1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-33, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:l, SEQ ID N0:3, SEQ ID NO:6-7, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO: 15, SEQ ED NO: 17. SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:25, SEQ ID 
NO:28-29. and SEQ ID NO:32-33, 

c) a polypeptide con^rising a naturally occurring amino acid sequence at least 98% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:4andSEQIDNO:21, 

d) a polypeptide comprising a naturally occurring amino acid sequence at least 97% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 18 and SEQ ID NO:23-24, 

e) a polypeptide consisting essentially of a naturally occurring amino acid sequence at 
least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO:2, SEQ ID N0:5, SEQ ED N0:9, SEQ ED NO: 13-14, SEQ ED NO: 19, 
SEQ ID NO:26-27 and SEQ ED NO:30-31, 

f) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ED NO: 1-33, and 

g) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ED NO: 1-33. 

2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-33. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO:34-66. 
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6. A recombiBant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method of producing a polypeptide of claim 1, the method comprising- 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said i^combinant 
polynucleotide comprises a promoter sequence operably Unked to a polynucleotide 
encodmg the polypeptide of claim 1, and 

b) recovering the polypeptide so expressed. 

10. A method of claim 9, wherein the polypeptide comprises an ammo acid sequence selected 
irom the group consisting of SEQ ID NO: 1-33. 

11. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ JD NO:34-66, 

b) a polynucleotide comprismg a naturally occurring polynucleotide sequence at least 
90% Identical to a polynucleotide sequence selected from the group consistmg of 
SEQ ID NO:34-43. SEQ ID NO:45-50, SEQ TD NO.Sl, SEQ ID NO:55. SEQ ID 
NO:58-60. SEQ ID NO:62. and SEQ ID NO;65-66, 

0 a polynucleotide comprising a naturally oc^uiring polynucleotide sequence at least 
95% Identical to the polynucleotide sequence of SEQ ID N0:51 

d) a polynucleotide con^rising a naturally occumng polynucleotidl sequence at least 
98% Identical to the polynucleotide sequence of SEQ ID NO:53 

e) a polynucleotide comprising a naturally occmring polynucleotide sequence at least 
96% identical to a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:54 and SEQ ID 56-57, 

f) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
93% Identical to the polynucleotide sequence of SEQ ID N0:61. 
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g) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
99% identical to the polynucleotide sequence of SEQ ID NO:63, 

h) a polynucleotide consisting essentially of a naturally occurring polynucleotide 
sequence at least 90% identical to the polynucleotide sequence consisting of SEQ ID 
NO:64, 

i) a polynucleotide complementary to a polynucleotide of a), 
j) a polynucleotide complementary to a polynucleotide of b), 
k) a poljnnucleotide complementary to a polynucleotide of c), 
1) a polynucleotide complementary to a polynucleotide of d), 
m) a polynucleotide complementary to a polynucleotide of e), 
n) a polynucleotide complementary to a polynucleotide of f), 
o) a polynucleotide con^lementary to a polynucleotide of g), 

p) a polynucleotide complementary to a polynucleotide of h), and 
q) an RNA equivalent of a)-p). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method conqmsing: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sanq)le, 
and which probe specifically hybridizes to said target polynucleotide, under 
conditions whereby a hybridization complex is formed between said probe and said 
target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 

16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) amplifymg said target polynucleotide or firagment thereof using polymerase cham 
reaction amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
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17. A composition comprismg a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient. 



18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ E) NO: 1-33. 

19. A method for treating a disease or condition associated with decreased expression of 

10 functional CGDD. comprising administering to a patient in need of such treatment the composition of 
claim 17. 



20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method conq)rising: 

15 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity m the sample. 

21. A composition comprising an agonist compound identified by a method of claim 20 and a 
pharmaceutically acceptable excipient. 

20 

22. A method for treatmg a disease or condition associated with decreased expression of 
functional CGDD. comprising administering to a patient in need of such treatment a composition of 
claim 21. 



25 23. Amethodofscreeningacon^oundforeffectivenessasanantagonistofapolypeptideof 

claim 1, the method comprising: 

a) exposing a sample con5)rising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

30 24. A composition comprising an antagonist compomid identified by a method of claim 23 

and a pharmaceutically acceptable excipient. 

25. A method for treating a disease or condition associated with overexpression of functional 
CGDD, comprising administering to a patient in need of such treatment a composition of claim 24 

35 
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26. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a cornpound that specifically binds to the polypeptide of claim 1. 

27. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under 
conditions permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test 
compound, and 

c) con^aring the activity of the polypeptide of claim 1 in the presence of the test 
compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 
presence of the test compound is indicative of a cocqpound that modulates the activity 
of the polypeptide of claim 1 . 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample conq)rising the target polynucleotide to a compound, under 
conditions suitable for the expression of die target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
amounts of the compound and in the absence of the compoimd. 

29. A method of assessing toxicity of a test conopound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 
polynucleotide sequence of a polynucleotide of claim 12 or fragment thereof. 
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c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample with 
the amount of hybridization complex in an untreated biological sample, wherein a 
difference in the amount of hybridization complex in the treated biological sample is 
indicative of toxicity of the test compound. 

30. A method for a diagnostic test for a condition or disease associated with the expression of 
CGDD in a biological sample, the method conq)rising: 

a) combining the biological san^le with an antibody of claim 11, under conditions 
suitable for the antibody to bind the polypeptide and form an antibody:polypeptide 

complex, and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 

31. The antibody of claim 11, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab')2 fragment, or 

e) a humanized antibody. 

32. A composition coiiq)rising an antibody of claim 11 and an acceptable excipient. 

33. A method of diagnosing a condition or disease associated with the expression of CGDD 
in a subject, comprismg administering to said subject an effective amount of the composition of claim 
32. 

34. A composition of claim 32, further comprising a label. 

35. A method of diagnosing a condition or disease associated with the expression of CGDD 
in a subject, conq)rising administering to said subject an effective amount of the composition of claim 
34. 

36. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
11, the method conqprising: 
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a) immunizrag an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consistmg of SEQ ID NO: 1-33, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibodies from the animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a 
polyclonal antibody which specifically binds to a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-33. 

37. A polyclonal antibody produced by a method of claim 36. 

38. A composition comprising the polyclonal antibody of claim 37 and a suitable earner. 

39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
11, the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-33, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating from the culture monoclonal antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQIDNO;l-33. 

40. A monoclonal antibody produced by a method of claim 39. 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 

42. The antibody of claim 11, wherein the antibody is produced by screening a Fab 
expression library. 

43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 
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44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-33 in a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with the sample under conditions to allow 
specific binding of the antibody and the polypeptide, and 
5 b) detecting specific binding, wherein specific binding indicates the presence of a 

polypeptide comprising an ammo acid sequence selected from the group consisting of 
SEQ ID NO: 1-33 m the sample. 

45. A method of purifying a polypeptide comprising an amino acid sequence selected from 
10 the group consisting ofSEQ ID NO:l-33 from a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with the sample under conditions to allow 
specific binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide 
conq)rising an ammo acid sequence selected from the group consisting of SEQ ID 

15 NO: 1-33. 



20 



13. 



46. A microarray wherein at least one element of the microaxray is a polynucleotide of claim 



47. A method of generating an expression profile of a sample which contains 
polynucleotides, the method comprising: 

a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microanay of claun 46 with the labeled 
polynucleotides of the sample under conditions suitable for the formation of a 

25 hybridization complex, and 

c) quantifying the expression of the polynucleotides in the sample. 

48. An array comprising different nucleotide molecules affixed in distinct physical locations 
on a solid substrate, wherein at least one of said nucleotide molecules comprises a first 

30 oUgonucleotide or polynucleotide sequence specifically hybridizable with at least 30 contiguous 
nucleotides of a target polynucleotide, and wherein said target polynucleotide is a polynucleotide of 
claim 12. 



49. An array of claim 48. wherein said first oligonucleotide or polynucleotide sequence 
35 completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 



is 
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50. An aixay of claim 48, wherein said furst oligonucleotide or polynucleotide sequence is 
completely complementary to at least 60 contiguous nucleotides of said target polynucleotide. 

51. An array of claim 48, wherein said &st oligonucleotide or polynucleotide sequence is 
completely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

53. An array of claim 48, further comprising said target polynucleotide hybridized to a 
nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 

55. An array of claim 48, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct physical 
location have the same sequence, and each distinct physical location on the substrate contains 
nucleotide molecules havmg a sequence which differs from the sequence of nucleotide molecules at 
another distmct physical location on the substrate. 

56. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 1 . 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 

58. A polypeptide of claim 1, con5»rising the amino acid sequence of SEQ ID N0:3. 

59. A polypeptide of claim 1, conqjrising the amino acid sequence of SEQ ID N0:4. 

60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:5. 

61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:6. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

63. A polypeptide of claim 1, comprising the ammo acid sequence of SEQ ID N0:8. 
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64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:9. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO. IO. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 11. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 12. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 14. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 15. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 18. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 

76. A polypeptide of claim 1, conq)rising the anaino acid sequence of SEQ ID N0:21. 

77. A polypeptide of claim 1, con^rising the amino acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ JD NO:24. 

80. A polypeptide of claim 1, conq)rising the anoino acid sequence of SEQ ID NO:25. 

81. A polypeptide of claim 1, con^rising the amino acid sequence of SEQ ID NO:26. 
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82. A polypeptide of claim 1. comprising the amino acid sequence of SEQ ID NO:27. 

83. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:28. 

84. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 

85. A polypeptide of claim I, comprising the amino acid sequence of SEQ ID NO:30. 

86. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID N0:31. 

87. A polypeptide of claim 1, coDoprising the amino acid sequence of SEQ ID NO:32. 

88. A polypeptide of claim 1. comprising the amino acid sequence of SEQ ID NO:33. 

89. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 



NO:34. 



90. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:35. 

20 

91. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:36. 



92. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

25 NO:37. 



93. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:38. 



30 



35 



94. A polynucleotide of claim 12, con^rising the polynucleotide sequence of SEQ ID 

NO:39. 



95. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:40. 
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96. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

N0:41. 



97. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:42. 



98. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:43. 



10 99. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:44. 

100. A polynucleotide of claim 12, comprismg the polynucleotide sequence of SEQ ID 

NO:45. 

15 

101. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:46. 



102. A polynucleotide of claun 12, comprismg the polynucleotide sequence of SEQ ID 

20 NO:47. 



103. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:48. 



25 104. A polynucleotide of claim 12, comprismg the polynucleotide sequence of SEQ ID 

NO:49. 

105. A polynucleotide of claim 12, con^jrising the polynucleotide sequence of SEQ ID 

N0:50. 

30 

106. A polynucleotide of claun 12, comprising the polynucleotide sequence of SEQ ID 

N0:51. 



107. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

35 NO:52. 
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108. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:53. 

109. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:54. 



1 10. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:55. 



11 1 . A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:56. 



1 12. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:57. 

15 

1 13. A polynucleotide of claim 12, conqjrising the polynucleotide sequence of SEQ ID 

NO:58. 



1 14. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:59. 



1 15. A polynucleotide of claim 12, conopising the polynucleotide sequence of SEQ ID 

NO:60. 



25 1 16. A polynucleotide of claim 12, conq)rising the polynucleotide sequence of SEQ ID 

N0:61. 

117. A polynucleotide of claim 12, comprising tiie polynucleotide sequence of SEQ ID 

NO:62. 

30 

1 18. A polynucleotide of claim 12, conq)rising tiie polynucleotide sequence of SEQ ID 

NO:63. 



1 19. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

35 NO:64. 
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120. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:65. 



121. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:66. 
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<110> INCYTE GENOMICS, INC. 
GRIFFIN, Jennifer A 
RAMKUMAR, Jayalaxmi 
EMERLING, Brooke M. 
KABLE, Amy E. 
ELLIOTT, Vicki S. 
MARQUIS, Joseph P. 
BAUGHN, Mariah R. 
GORVAD, Ann E. 
YUE, Henry 
LEE, Ernestine A. 
BECHA, Shanya D. 
TANG, Y. Tom 
TRAN, Uyen K. 
SWARNAKAR, Anita 
LEE, Sally 
ISON, Craig H. 
HAFALIA, April J. A. 
TRAN, Bao 

SPRAGUE, William W. 
LEE, Soo Yeun 
KHARE, Reena 
GANDHI, Ameena R. 
GIETZEN, Kimberly J. 
BHATIA, Umesh 
BURRXLL, John D. 
BLAKE, Julie J. 
HO, Anne 
Zheng, Wen j in 

<120> PROTEINS ASSOCIATED WITH CELL 
<130> PF-1319 PCT 



GROWTH, DIFFERENTIATION, AND DEATH 



<140> To Be Assigned 
<141> Herewith 

<150> US 60/340,747 
<151> 2001-12-07 

<150> US 60/342,761 
<151> 2001-12-20 

<150> US 60/349,705 
<151> 2002-01-15 

<150> US 60/354,764 
<151> 2002-02-06 

<150> US 60/356,216 
<151> 2002-02-12 

<160> 66 

<170> PERL Program 

<210> 1 

<211> 1104 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7722608CD1 



1/89 



wo 03/050253 „^^„.^ 

PCT/US02/39133 



<400> 1 

Met Gly Asn Ala Ala Gly Ser Ala Glu Gin Pro Ala Gly Pro Ala 
^ 10 IS 

Ala Pro Pro Pro Lys Gin Pro Ala Pro Pro Lys Gin Pro Met Pro 

Ala Ala Gly Glu Leu Glu Ala Arg Phe Asn Arg Ala Leu Asn Cys 

35 4Q 
Met Asn Leu Pro Pro Asp Lys Val Gin Leu Leu Ser Gin Tyr Asp 

Asn Glu Lys Lys Trp Glu Leu He Cys Asp Gin Glu Arg Phe Gin 

65 70 
Val Lys Asn Pro Pro Ala Ala Tyr He Gin Lys Leu Lys Ser Tyr 

80 Q5 
val Asp Thr Gly Gly Val Ser Arg Lys Val Ala Ala Asp Trp Met 

Ser Asn Leu Gly Phe Lys Arg Arg Val Gin Glu Ser Thr Gin Val 
HO 

Leu Arg Glu Leu Glu Thr Ser Leu Arg Thr Asn His He Gly Trp 

130 13C 

Val Gin Glu Phe Leu Asn Glu Glu Asn Arg Gly Leu Asp Val Leu 

145 isn 

Leu Glu Tyr Leu Ala Phe Ala Gin Cys Ser Val Thr Tyr Asp Met 

160 Ifiq 

Glu Ser Thr Asp Asn Gly Ala Ser Asn Ser Glu Lys Asn Lys Pro 

175 Tan 

Leu Glu Gin Ser Val Glu Asp Leu Ser Lys Gly Pro Pro Ser Ser 
185 -Qj. 
val Pro Lys Ser Arg His Leu Thr He Lys Leu Thr Pro Ala His, 

205 21 n 

Ser Arg Lys Ala Leu Arg Asn Ser Arg He Val Ser Gin Lys Asp 
215 220 ooc 

Asp Val His val Cys He Met Cys Leu Arg Ala He Met Asn Tyr 

235 o^n 
Gin Ser Gly Phe Ser Leu Val Met Asn His Pro Ala Cys Val Asn 
245 250 occ 

Glu He Ala Leu Ser Leu Asn Asn Lys Asn Pro Arg Thr Lys Ala 
260 265 o7n 

Leu Val Leu Glu Leu Leu Ala Ala Val Cys Leu Val Arg Gly Gly 

280 oQc 
His Asp He He Leu Ala Ala Phe Asp Asn Phe Lys Glu Val Cys 

295 -inn 
Gly Glu Gin His Arg Phe Glu Lys Leu Met Glu Tyr Phe Arg Asn 

ni 310 315 

Glu Asp Ser Asn He Asp Phe Met Val Ala Cys Met Gin Phe He 

■^20 325 
Asn He Val Val His Ser Val Glu Asn Met Asn Phe Arg Val Phe 

340 

Leu Gin Tyr Glu Phe Thr His Leu Gly Leu Asp Leu Tyr Leu Glu 

Arg Leu Arg Leu Thr Glu Ser Asp Lys Leu Gin Val Gin He Gin 
365 370 T-c- 

Ala Tyr Leu Asp Asn He Phe Asp Val Gly Ala Leu Leu Glu Asp 

Thr Glu Thr Lys Asn Ala Val Leu Glu III Met Glu Glu Leu Gin 
395 400 /ine: 

Glu Gin val Ala Leu Leu Thr Glu Arg Leu Arg Asp Ala Glu Asn 
1" 415 Aon 

Glu ser Met Ala Lys He Ala Glu Leu Glu Lys Gin Leu Ser Gin 
425 430 ^-ii- 

Ala Arg Lys Glu Leu Glu Thr Leu Arg Glu Arg Phe Ser Glu Ser 
440 

Thr Ala Met Gly Ala Ser Arg Arg Pro Pro Glu Pro Glu Lys Ala 
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460 /gc 
Pro Pro Ala Ala Pro Thr Arg Pro Ser Ala Leu Glu Leu Lys Val 

475 480 
Glu Glu Leu Glu Glu Lys Gly Leu He Arg He Leu Arg Gly Pro 

, , 490 
Gly Asp Ala Val Ser He Glu He Leu Pro Val Ala Val Ala Thr 

500 505 
Pro Ser Gly Gly Asp Ala Pro Thr Pro Gly Val Pro Thr Gly Ser 
^■^^ 520 525 

Pro Ser Pro Asp Leu Ala Pro Ala Ala Glu Pro Ala Pro Gly Ala 

Ala Pro Pro Pro Pro Pro Pro Leu Pro Gly Leu Pro Ser Pro Gin 
545 55Q |-|-_ 

Glu Ala Pro Pro Ser Ala Pro Pro Gin Ala Pro Pro Leu Pro Gly 
560 565 r-yQ 

Ser Pro Glu Pro Pro Pro Ala Pro Pro Leu Pro Gly Asp Leu Pro 
^'^ 580 585 

Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Gly Thr Asp Gly Pro 
550 595 g.- 

Val Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Gly Gly Pro 

610 61-5 
Pro Asp Ala Leu Gly Arg Arg Asp Ser Glu Leu Gly Pro Gly Val 

620 625 
Lys Ala Lys Lys Pro He Gin Thr Lys Phe Arg Met Pro Leu Leu 
6-^5 640 645 

Asn Trp Val Ala Leu Lys Pro Ser Gin He Thr Gly Thr Val Phe 
650 655 fifin 

Thr Glu Leu Asn Asp Glu Lys Val Leu Gin Glu Leu Asp Met Ser 
665 670 fiTc 

Asp Phe Glu Glu Gin Phe Lys Thr Lys Ser Gin Gly Pro Ser Leu 

S85 690 
Asp Leu Ser Ala Leu Lys Ser Lys Ala Ala Gin Lys Ala Pro Ser 

700 701; 
Lys Ala Thr Leu He Glu Ala Asn Arg Ala Lys Asn Leu Ala He 
mv r '■'•^ 715 720 

Thr Leu Arg Lys Gly Asn Leu Gly Ala Glu Arg He Cys Gin Ala 
^25 

He Glu Ala Tyr Asp Leu Gin Ala Leu Gly Leu Asp Phe Leu Glu 

745 

Leu Leu Met Arg Phe Leu Pro Thr Glu Tyr Glu Arg Ser Leu He 

"^55 760 
Thr Arg Phe Glu Arg Glu Gin Arg Pro Met Glu Glu Leu Ser Glu 

'''^0 775 
Glu Asp Arg Phe Met Leu Cys Phe Ser Arg He Pro Arg Leu Pro 

785 790 rtqc 

Glu Arg Met Thr Thr Leu Thr Phe Leu Gly Asn Phe Pro Asp Thr 

Ala Gin Leu Leu Met Pro Gin Leu Asn Ala He He Ala Ala ler 
^15 820 fioc 

Met Ser He Lys Ser Ser Asp Lys Leu Arg Gin He Leu Glu He 

val Leu Ala Phe Gly Asn Tyr Met Asn Ser Ser Lys Arg Gly 2a 

Ala Tyr Gly Phe Arg Leu Gin Ser Leu Isp Ala Leu Leu Glu Met 

855 870 
Lys Ser Thr Asp Arg Lys Gin Thr Leu Leu His Tyr Leu Val Lys 

^'^^ 880 QQi; 

val He Ala Glu Lys Tyr Pro Gin Leu Thr Gly Phe His Ser Asp 
T 895 900 

Leu His Phe Leu Asp Lys Ala Gly Ser Val Ser Leu Asp Ser Val 

910 915" 
Leu Ala Asp Val Arg Ser Leu Gin Arg Gly Leu Glu Leu Thr Gin 

920 925 930 
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Arg Glu Phe Val Arg Gin Asp Asp Cys Met Val Leu Lys Glu Phe 

940 945 

Leu Arg Ala Asn Ser Pro Thr Met Asp Lys Leu Leu Ala Asp Ser 
950 955 

Lys Thr Ala Gin Glu Ala Phe Glu Ser Val Val Glu Tyr Phe Gly 

970 

Glu Asn Pro Lys Thx Thr Ser Pro Gly Leu Phe Phe Ser Leu Phe 
c , 985 990 

Ser Arg Phe lie Lys Ala Tyr Lys Lys Ala Glu Gin Glu Val Glu 
on m 1000 1005 

Gin Trp Lys Lys Glu Ala Ala Ala Gin Glu Ala Gly Ala Asp Thr 

r, 1015 1020 

Pro Gly Lys Gly Glu Pro Pro Ala Pro Lys Ser Pro Pro Lys Ala 

■^^^^ 1030 1035 

Arg Arg Pro Gin Met Asp Leu He Ser Glu Leu Lys Arg Arg Gin 

1040 1045 1050 

Gin Lys Glu Pro Leu lie Tyr Glu Ser Asp Arg Asp Gly Ala He 
^1 1055 1060 -LQgg 

Glu Asp lie He Thr Asp Leu Arg Asn Gin Pro Tyr He Arg Ala 

■'-°'^0 1075 1080 

Asp Thr Gly Arg Arg Ser Ala Arg Arg Arg Pro Pro Gly Pro Pro 

I , ^^^^ 1090 1095 

Leu Gin Val Thr Ser Asp Leu Ser Leu 
1100 

<210> 2 

<211> 1299 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505869CD1 

<400> 2 

Met Lys Val Phe Ser Leu He Gin Gin Ala Val Thr Arg Tyr Arg 
^5 10 



Lys Ala He Ala Lys Asp Pro Asn Tyr T^^ He Gin Val His A^g 

20 25 
Leu Glu His Gly Asp Gly Gly He Leu Asp Leu Asp Asp He Leu 
_ 35 40 «5 

Cys Asp Val Ala Asp Asp Lys Asp Arg Leu Val Ala Val Phe Asp 

50 55 
Glu Gin Asp Pro His His Gly Gly Asp Gly Thr Ser Ala Ser Ser 
€5 70 7c 

Thr Gly Thr Gin Ser Pro Glu He Phe Gly Ser Glu Leu Gly Th? 

Asn Asn Val Ser Ala Phe Gin Pro Tyr Gin Ala Thr Ser Glu lie 

Glu Val Thr Pro Ser Val Leu Arg Ala l^n Met Pro Leu His Val 
110 115 -i^f, 

Arg Arg Ser Ser Asp Pro Ala Leu He Gly Leu Ser Thr Ser Val 
125 130 

Ser Asp ser Asn Phe Ser Ser Glu Glu Pro Ser Arg Lys Asn Pro 

140 145 , rn 

Thr Arg Trp Ser Thr Thr Ala Gly Phe Leu Lys Gin Asn Thr Ala 
m 160 165 

Gly Ser Pro Lys Thr Cys Asp Arg Lys Lys Asp Glu Asn Tyr Arg 

175 180 
Ser Leu Pro Arg Asp Thr Ser Asn Trp Ser Asn Gin Phe Gin Arg 

a . , 190 195 

Asp Asn Ala Arg Ser Ser Leu Ser Ala Ser His Pro Met Val Gly 
200 205 210 
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Lys Trp Leu Glu Lys Gin Glu Gin Asp Glu Asp Gly Thr Glu Glu 

215 220 225 

Asp Asn Ser Arg Val Glu Pro Val Gly His Ala Asp Thr Gly Leu 

230 235 240 

Glu His lie Pro Asn Phe Ser Leu Asp Asp Met Val Lys Leu Val 

245 250 255 

Glu Val Pro Asn Asp Gly Gly Pro Leu Gly He His Val Val Pro 

260 265 270 

Phe Ser Ala Arg Gly Gly Arg Thr Leu Gly Leu Leu Val Lys Arg 

275 280 285 

Leu Glu Lys Gly Gly Lys Ala Glu His Glu Asn Leu Phe Arg Glu 

290 295 300 

Asn Asp Cys He Val Arg He Asn Asp Gly Asp Leu Arg Asn Arg 

305 310 3-^5 

Arg Phe Glu Gin Ala Gin His Met Phe Arg Gin Ala Met Arg Thr 

320 325 330 

Pro He He Trp Phe His Val Val Pro Ala Ala Asn Lys Glu Gin 

335 340 
Tyr Glu Gin Leu Ser Gin Ser Glu Lys Asn Asn Tyr Tyr Ser Ser 

355 360 
Arg Phe Ser Pro Asp Ser Gin Tyr He Asp Asn Arg Ser Val Asn 

365 370 
Ser Ala Gly Leu His Thr Val Gin Arg Ala Pro Arg Leu Asn His 

385 390 
Pro Pro Glu Gin He Asp Ser His Ser Arg Leu Pro His Ser Ala 
„. 395 400 

His Pro Ser Gly Lys Pro Pro Ser Ala Pro Ala Ser Ala Pro Gin 
^■^^ 415 420 

Asn val Phe Ser Thr Thr Val Ser Ser Gly Tyr Asn Thr Lys Lys 
T-i 430 435 

He Gly Lys Arg Leu Asn He Gin Leu Lys Lys Gly Thr Glu Gly 

445 450 
Leu Gly Phe Ser He Thr Ser Arg Asp Val Thr He Gly Gly Ser 
„ , 460 465 

Ala Pro He Tyx Val Lys Asn He Leu Pro Arg Gly Ala Ala He 

470 475 
Gin Asp Gly Arg Leu Lys Ala Gly Asp Arg Leu He Glu Val Asn 
485 490 4gc 

Gly Val Asp Leu Val Gly Lys Ser Gin Glu Glu Val Val Ser Leu 

505 510 
Leu Arg Ser Thr Lys Met Glu Gly Thr Val Ser Leu Leu Val Phe 
515 520 525 

Arg Gin Glu Asp Ala Phe His Pro Arg Glu Leu Asn Ala Glu Pro 

535 540 
Ser Gin Met Gin He Pro Lys Glu Thr Lys Ala Glu Asp Glu Asp 
545 550 555 

He Val Leu Thr Pro Asp Gly Thr Arg Glu Phe Leu Thr Phe Glu 

560 565 
Val Pro Leu Asn Asp Ser Gly Ser Ala Gly Leu Gly Val Ser Val 
575 580 535 

Lys Gly Asn Arg Ser Lys Glu Asn His Ala Asp Leu Gly He Phe 
590 595 gQ„ 

Val Lys Ser He He Asn Gly Gly Ala Ala Ser Lys Asp Gly Arg 

610 615 
Leu Arg Val Asn Asp Gin Leu He Ala Val Asn Gly Glu Ser Leu 

625 630 
Leu Gly Lys Thr Asn Gin Asp Ala Met Glu Thr Leu Arg Arg Ser 
635 640 fi4c 

Met Ser Thr Glu Gly Asn Lys Arg Gly Met He Gin Leu He Val 
650 655 660 

Ala Arg Arg He Ser Lys Cys Asn Glu Leu Lys Ser Pro Gly Ser 
665 670 57S 

Pro Pro Gly Pro Glu Leu Pro He Glu Thr Ala Leu Asp Asp Arg 
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r,-, , 685 690 

Glu Arg Arg He Ser His Ser Leu Tyr Ser Gly He Glu Gly Leu 

695 700 
Asp Glu Ser Pro Ser Arg Asn Ala Ala Leu Ser Arg He Met Gly 

'^10 715 720 

Glu Ser Gly Lys Tyr Gin Leu Ser Pro Thr Val Asn Met Pro Gin 

''^^ 730 735 

Asp Asp Thr val He He Glu Asp Asp Arg Leu Pro Val Leu Pro 

"^^O 745 750 

Pro His Leu Ser Asp Gin Ser Ser Ser Ser Ser His Asp Asp Val 

755 750 
Gly Phe Val Thr Ala Asp Ala Gly Thr Trp Ala Lys Ala Ala He 

775 ^gg 

ser Asp Ser Ala Asp Cys Ser Leu Ser Pro Asp Val Asp Pro Val 

"^^^ 790 795 

Leu Ala Phe Gin Arg Glu Gly Phe Gly Arg Gin Ser Met Ser Glu 

800 805 am 

Lys Arg Thr Lys Gin Phe Ser Asp Ala Ser Gin Leu Asp Phe Val 

815 820 825 

Lys Thr Arg Lys Ser Lys Ser Met Asp Leu Gly lie Ala Asp Glu 
mt, r 835 840 

Thr Lys Leu Asn Thr Val Asp Asp Gin Lys Ala Gly Ser Pro Ser 

850 855 
Arg Asp Val Gly Pro Ser Leu Gly Leu Lys Lys Ser Ser Ser Leu 

860 865 870 

Glu Ser Leu Gin Thr Ala Val Ala Glu Val Thr Leu Asn Gly Asp 

875 880 ggr 

He Pro Phe His Arg Pro Arg Pro Arg He He Arg Gly Arg Gly 
r. , , 895 900 

Cys Asn Glu Ser Phe Arg Ala Ala He Asp Lys Ser Tyr Asp Lys 
905 910 qis 

Pro Ala Val Asp Asp Asp Asp Glu Gly Met Glu Thr Leu Glu Glu 
920 925 930 

Asp Thr Glu Glu Ser Ser Arg Ser Gly Arg Glu Ser Val Ser Thr 

n 940 945 

Ala Ser Asp Gin Pro Ser His Ser Leu Glu Arg Gin Met Asn Gly 

950 955 
Asn Gin Glu Lys Gly Asp Lys Thr Asp Arg Lys Lys Asp Lys Thr 
965 970 gjr 

Gly Lys Glu Lys Lys Lys Asp Arg Asp Lys Glu Lys Asp Lys Met 
, 985 990 

Lys Ala Lys Lys Gly Met Leu Lys Gly Leu Gly Asp Met Phe Arg 
995 1000 1005 

lie Gin Ala Lys Thr Arg Glu Phe Arg Glu Arg Gin Ala Arg Glu 
1010 1015 Tnon 

Arg Asp Tyr Ala Glu He Gin Asp Phe His Arg Thr Phe Gly Cys 

1030 1035 
Asp Asp Glu Leu Met Tyr Gly Gly Val Ser Ser Tyr Glu Gly Ser 
1040 1045 insn 

Met Ala Leu Asn Ala Arg Pro Gin Ser Pro Arg Glu Gly His Met 
V, ^ , , 1060 106S 

Met Asp Ala Leu Tyr Ala Gin Val Lys Lys Pro Arg Asn Ser Lys 
_ _ 1070 1075 -[^Qg„ 

Pro Ser Pro Val Asp Ser Asn Arg Ser Thr Pro Ser Asn His Asp 
^'^^^ 1090 1095 

Arg He Gin Arg Leu Arg Gin Glu Phe Gin Gin Ala Lys Gin Asp 

m ^-^^^ 1105 1110 

Glu Asp Val Glu Asp Arg Arg Arg Thr Tyr Ser Phe Glu Gin Pro 

■^^^^ 1120 1125 

Trp Pro Asn Ala Arg Pro Ala Thr Gin Ser Gly Arg His Ser Val 

1130 1135 
Ser val Glu Val Gin Met Gin Arg Gin Arg Gin Glu Glu Arg Glu 

1145 1150 1155 
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Ser Ser Gin Gin Ala Gin Arg Gin Tyr Ser Ser Leu Pro Arg Gin 

1165 1170 
Ser Arg Lys Asn Ala Ser Ser Val Ser Gin Asp Ser Trp Glu Gin 
^ ■'■■'•'^^ 1180 118S 

Asn Tyr Ser Pro Gly Glu Gly Phe Gin Ser Ala Lys Glu Asn Pro 

1190 1195 -1200 

Arg Tyr Ser Ser Tyr Gin Gly Ser Arg Asn Gly Tyr Leu Gly Gly 
„• , 1210 1215 

His Gly Phe Asn Ala Arg Val Met Leu Glu Thr Gin Glu Leu Leu 

1220 1225 1230 

Arg Gin Glu Gin Arg Arg Lys Glu Gin Gin Met Lys Lys Gin Pro 

1235 1240 1245 

Pro Ser Glu Gly Pro Ser Asn Tyr Asp Ser Tyr Lys Lys Val Gin 

■^^^^ 1255 1260 

Asp Pro Ser Tyr Ala Pro Pro Lys Gly Pro Phe Arg Gin Asp Val 

■"•^^^ 1270 1275 

Pro Pro ser Pro Ser Gin Val Ala Arg Leu Asn Arg Leu Gin Thr 

1285 1290 

Pro Glu Lys Gly Arg Pro Phe Tyr Ser 
1295 

<210> 3 
<211> 433 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7502420CD1 

<400> 3 

Met Glu Ala Cys Ser Pro Cys Arg Ser Glu Thr Tyr Glu Leu Gin 
5 10 ic 

Leu Pro Ala Ser lie Gly Arg Pro Arg Arg Asn Cys Gly Gin Pro 

He Val Pro Glu Gin Gly Lys Pro Pro Pro Ser Gly Ser Val Gin 
35 40 .c 

Gly Leu Arg Gin Trp Gin Ser Ser His Arg Arg Ala Cys Gly Leu 
50 55 

Gly Cys Glu Ala Arg Ala Gly Gly Gly Glu Glu Pro Arg Gly Arg 

Ala Ser Ser Val Ala Gly Trp Val Gly A^a Phe Arg Ala Pro Phe 
80 35 rtrt 

He Glu Ala Ala Val Ala Gly Leu Gly Ala Gly Ser Gly Lys Arg 

Arg Arg Gly Trp Lys Met Pro Val His Ser Arg Gly Asp Lys III 

115 ion 
Glu Thr Asn His His Asp Glu Met Glu Val Asp Tyr Ala Glu Asn 
125 130 -jTc 

Glu Gly Ser Ser Ser Glu Asp Glu Asp Thr Glu Ser Ser Ser Val 

145 icn 

Ser Glu Asp Gly Asp Ser Ser Glu Met Asp Asp Glu Asp Cys Glu 

160 165 
Arg Arg Arg Met Glu Cys Leu Asp Glu Met Ser Asn Leu Glu Lys 

170 175 ign 

Gin Phe Thr Asp Leu Lys Asp Gin Leu Tyr Lys Glu Arg Leu Ser 

190 195 
Gin Val Asp Ala Lys Leu Gin Glu Val He Ala Gly Lys Ala Pro 
m nv 205 210 

Glu Tyr Leu Glu Pro Leu Ala Thr Leu Gin Glu Asn Met Gin He 

Arg Thr Lys Val Ala Gly He Tyr Arg Glu Leu Cys Leu Glu Ser 
230 235 240 
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Lys 


Asn 


Lys 


Tyr 




Cys 


V7JLU i.xe 


















Cys 




Ser 




Lys 


Leu 


Leu 


jjeu lyr 










^ o u 








T 

1/611 


Pin 


ijXU 


Lys 


j.i.e 




Arg 


Leu Glu 










OIK 








Asp 


±xe 


Thr 


Ser 




Lsu 


Trp 


Asn Asp 










0 Q n 








Lys 




Lys 


Asp 


Pro 




Ser 


Pro Asp 










305 








Val 


Ser 


Gly 


Pro 


Tyr 


lie 


Val 


Tyr Met 










J z U 








Leu 


GiU 


Asp 


Trp 


Thr 


Thr 


He 


Arg Lys 










T "3 c: 








Pro 


His 


Arg Val 


Lys 


Thr 


Glu 


Pro Pro 










OCA 








Leu 


His 


Ser 


Ala 


Arg 


Ser 


Glu 


Glu Gly 










365 








Glu 


Trp 


Tyr 


He 


Arg 
380 


Gly 


Gin 


Thr He 


Glu 


Cys 


Pro 


Thr 


Ser 


Ala 


Val 


He Thr 










395 








Val 


Trp 


Phe 


Lys 


Arg 


Pro 


Asp 


Gly Ser 










410 








Ser 


Gin 


Leu 


Gin 


Lys 


Gly 


Lys 


Tyr Ser 










425 









<210> 4 

<211> 199 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7506455CD1 



<400> 4 
















Met 


Glu 


Cys 


Arg 


Lys 


Met 


Asn 


Gly Arg 


1 








5 










Asn 


Ala 


Glu 


Val 


Pro 


Arg 


Pro 


He 


Pro 










20 










He 


Pro 


Thr 


Glu 


Glu 


Glu 


Arg 


Arg Val 










35 










Glu 


Ser 


Phe 


Trp 


Phe 


Arg 


Ser 


Val 


Pro 










50 










Leu 


He 


Thr 


Gin 


Gly 
65 


Leu 


He 


Ser 


Lys 


Pro 


Lys 


Tyr 


Gly 


Ser 
80 


He 


Pro 


Lys 


Leu 


Gly 


Tyr 


Phe 


Ala 


Gly 


Lys 


Leu 


Ser 


Tyr 










95 










Lys 


Phe 


Lys 


Lys 


Leu 


Glu 


Asn 


Ser 


Pro 










110 










Ser 


Gly 


Gin 


Ala 


Arg 


Arg 


Ser 


Ser 


Pro 










125 










Lys 


Ser 


Lys 


Tyr 


Asp 


Ser 


Ser 


Val 


Ser 










140 










Thr 


Ser 


Pro 


Ala 


Ala 


Asp 


Asn 


He 


Glu 










155 










Pro 


He 


Pro 


Phe 


Ser 


Ser 


Ser 


Met 


Asn 










170 










He 


Thr 


Asp 


His 


He 


Val 


Gin 


Val 


Lys 










185 
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VtXII 


al sa 


Ser Arg 


vjxn 


xixs 














Asp 


Thr 


Val 


Gin 


Ser 


/^l 11 
VjXU 


zoo 










o'Tn 
f \} 


Pi 11 

l3±U 


Asp 


Arg His 


Ser 


Tl a 

xxe 


Z O W 










Z O D 




Leu 


Gin 


Ser 


Arg 


Lys 


^ J J 












Lys 


Lys 


Lys 


Pro 


VaX 


val 


Ti n 
oxu 










j1-> 


Leu 


Gin 


Asp 


Leu 


Asp 


lie 


DOC 










O O A 


Ala 


Met 


Ala 


Thr 


Leu 


Gly 












3 4D 


Val 


Lys 


Leu 


Glu 


Lys 


His 


o c c 

355 










360 


Arg 


Leu 


Tyr Tyr 


Asp 


Gly 


370 










375 


Cys 


He 


Asp Lys 


Lys 


Asp 


385 










390 


Thr 


He 


Asn His 


Asp 


Glu 


400 










405 


Lys 


Ser 


Lys 


Leu 


Tyr 


He 


415 










420 


He 


Lys 


His 


Ser 






430 












Ala 


Asp 


Phe Arg 


Glu 


Pro 


10 










15 


His 


He 


Gly Pro 


Asp 


Tyr 


25 










30 


Phe 


Ala 


Glu 


Cys 


Asn 


Asp 


40 










45 


Leu 


Ala 


Ala 


Thr 


Ser 


Met 


55 










60 


Gly 


He 


Leu 


Ser 


Ser 


His 


■T A 
/U 










75 


He 


Leu 


Ala 


Cys 


He 


Met 


o c 
OO 










n A 

90 


Val 


Lys 


Thr 


Cys 


Gin 


Glu 


1 A A 
100 










t A C 

X05 


Leu 


Gly 


Glu 


Ala 


Leu 


Arg 


lie: 
J.1D 










Ton 
X^U 


Pro 


Gly 


His 


Tyr 


Tyr 


Gin 


130 










135 


Gly 


Gin 


Ser 


Ser 


Phe 


Val 


145 










150 


Met 


Leu 


Pro 


His 


Tyr 


Glu 


160 










165 


Glu 


Ser 


Ala 


Pro 


Thr 


Gly 


175 










180 


Val 


Asn 


Lys 


Tyr 


Gly 


Asp 


190 










195 
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Thr Trp Asp Glu 



<210> 5 
<211> 37 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7506018CD1 

<400> 5 

Met Ser Gly Leu Arg Val Tyr Ser Thr Ser Val Thr Gly Ser Arg 
1 5 . 10 15 

Glu Asp Tyr Glu Leu Phe Val Glu Ala Val Glu Gin Asn Thr Leu 

20 25 30 

Gin Glu Phe Leu Lys Leu Ala 

35 

<210> 6 
<211> 312 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 353017CD1 

<400> 6 



Met 


Ala 


Leu 


Leu 


Tyr 


Glu 


Gly 


Asp He Ser Arg Gly He Ala 


Asp 


1 








5 






10 




15 


Gin 


Glu 


He 


He 


Ala 


Gly 


Gly 


Asp Val Lys Tyr Thr Thr 


Trp 


Met 










20 






25 




30 


Phe 


Glu 


Thr 


Gin 


Ala 


He 


Asp 


Thr Leu Gly Ala His Ser 


Ser 


Asp 










35 






40 




45 


Thr 


Val 


Glu 


Asn 


Ala 


Glu 


Lys 


He Pro Glu Leu Ala Arg 


Gly Asp 










50 






55 




60 


Val 


Cys 


Thr 


Ala 


Arg 


Trp 


Met 


Phe Glu Thr Arg Pro Leu 


Asp 


Ser 










65 






70 




75 


Met 


Asn 


Lys 


Met 


His 


Gin 


Ser 


Gin Glu Glu Ser Ala Val 


Thr 


He 










80 






85 




90 


Ser 


Lys 


Asp 


He 


Thr 


Gly 


Gly 


Asp Val Lys Thr Val Arg Tyr Met 










95 






100 




105 


Phe 


Glu 


Thr 


Gin 


His 


Leu 


Asp 


Gin Leu Gly Gin Leu His 


Ser 


Val 










110 






115 




120 


Asp 


Glu 


Val 


His 


Leu 


Leu 


Gin 


Leu Arg Ser Glu Leu Lys 


Glu 


He 










125 






130 




135 


Lys 


Gly 


Asn 


Val 


Lys 


Arg 


Ser 


He Lys Cys Phe Glu Thr 


Gin 


Pro 










140 






145 




150 


Leu 


Tyr 


Val 


He 


Arg 


Asp 


Gly 


Ser Gly Gin Met Leu Glu 


He 


Lys 










155 






160 




165 


Thr 


Val 


His 


Arg 


Glu 


Asp 


Val 


Glu Lys Gly Asp Val Arg Thr Ala 










170 






175 




180 


Arg 


Trp 


Met 


Phe 


Glu 


Thr 


Gin 


Pro Leu Asp Thr He Asn 


Lys 


Asp 










185 






190 




195 


lie 


Thr 


Glu 


He 


Lys 


Val 


Val 


Arg Gly He Ser Met Glu 


Glu 


Asn 










200 






205 




210 


Val 


Lys 


Gly 


Gly 


Val 


Ser 


Lys 


Ala Lys Trp Leu Phe Glu 


Thr 


Gin 










215 






220 




225 


Pro 


Leu 


Glu 


Lys 


He 


Lys 


Glu 


Ser Glu Glu Val He He 


Glu 


Lys 










230 






235 




240 


Glu 


Lys 


He 


He 


Gly 


Thr 


Asp 


Val Ser Arg Lys Cys Trp 


Met 


Phe 
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245 250 255 

Glu Thr Gin Pro Leu Asp lie Leu Lys Glu Val Pro Asp Ala Asp 

260 265 270 

Ser Leu Gin Arg Glu Glu lie lie Gly Gly Asp Val Gin Thr Thr 

275 280 285 

Lys His Leu Phe Glu Thr Leu Pro lie Glu Ala Leu Lys Asp Ser 

290 295 300 

Pro Asp lie Gly Lys Leu Gin Lys Asn His Cys Leu 

305 310 

<210> 7 

<211> 993 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> itiisc_f eature 

<223> Incyte ID No: 5137816CD1 

<400> 7 

Met Glu Lys Gin Leu Ser Pro Arg Met Gin Asn Asn Glu Glu Leu 
15 10 15 

Pro Thr Tyr Glu Glu Ala Lys Val Gin Ser Gin Tyr Phe Arg Gly 
20 25 30 

Gin Gin His Ala Ser Val Gly Ala Ala Phe Tyr Val Thr Gly Val 
35 40 45 

Thr Asn Gin Lys Met Arg Thr Glu Gly Arg Pro Ser Val Gin Arg 
50 55 60 

Leu Asn Pro Gly Lys Met His Gin Asp Glu Gly Leu Arg Asp Leu 
65 70 75 

Lys Gin Gly His Val Arg Ser Leu Ser Glu Arg Leu Met Gin Met 
80 85 90 

Ser Leu Ala Thr Ser Gly Val Lys Ala His Pro Pro Val Thr Ser 
95 100 105 

Ala Pro Leu Ser Pro Pro Gin Pro Asn Asp Leu Tyr Lys Asn Pro 
110 115 120 

Thr Ser Ser Ser Glu Phe Tyr Lys Ala Gin Gly Pro Leu Pro Asn 
125 130 135 

Gin His Ser Leu Lys Gly Met Glu His Arg Gly Pro Pro Pro Glu 
140 145 150 

Tyr Pro Phe Lys Gly Met Pro Pro Gin Ser Val Val Cys Lys Pro 
155 160 165 

Gin Glu Pro Gly His Phe Tyr Ser Glu His Arg Leu Asn Gin Pro 
170 175 180 

Gly Arg Thr Glu Gly Gin Leu Met Arg Tyr Gin His Pro Pro Glu 
185 190 195 

Tyr Gly Ala Ala Arg Pro Ala Gin Asp He Ser Leu Pro Leu Ser 
200 205 210 

Ala Arg Asn Ser Gin Pro His Ser Pro Thr Ser Ser Leu Thr Ser 
215 220 225 

Gly Gly Ser Leu Pro Leu Leu Gin Ser Pro Pro Ser Thr Arg Leu 
230 235 240 

Ser Pro Ala Arg His Pro Leu Val Pro Asn Gin Gly Asp His Ser 
245 250 255 

Ala His Leu Pro Arg Pro Gin Gin His Phe Leu Pro Asn Gin Ala 
260 265 270 

His Gin Gly Asp His Tyr Arg Leu Ser Gin Pro Gly Leu Ser Gin 
275 280 285 

Gin Gin Gin Gin Gin Gin Gin Gin His His His His His His His 
290 295 300 

Gin Gin Gin Gin Gin Gin Gin Pro Gin Gin Gin Pro Gly Glu Ala 
305 310 315 

Tyr Ser Ala Met Pro Arg Ala Gin Pro Ser Ser Ala Ser Tyr Gin 
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"^^'^ 325 330 

Pro Val Pro Ala Asp Pro Phe Ala lie Val Ser Arg Ala Gin Gin 

335 34Q 
Met val Glu lie Leu Ser Asp Glu Asn Arg Asn Leu Arg Gin Glu 

Leu Glu Gly Cys Tyr Glu Lys Val Ala Arg Leu Gin Lys Val Glu 

370 TTc 
Thr Glu He Gin Arg Val Ser Glu Ala Tyr Glu Asn Leu Val Lys 

385 390 
Ser ser Ser Lys Arg Glu Ala Leu Glu Lys Ala Met Arg Asn Lys 

400 4nR 
Leu Glu Gly Glu He Arg Arg Met His Asp Phe Asn Arg Asp Leu 
^■^^ 415 420 

Arg Glu Arg Leu Glu Thr Ala Asn Lys Gin Leu Ala Glu Lys Glu 
m. , 430 435 

Tyr Glu Gly Ser Glu Asp Thr Arg Lys Thr He Ser Gin Leu Phe 

445 450 
Ala Lys Asn Lys Glu Ser Gin Arg Glu Lys Glu Lys Leu Glu Ala 
r.-, ^ , 460 465 

Glu Leu Ala Thr Ala Arg Ser Thr Asn Glu Asp Gin Arg Arg His 
470 475 .gn 

He Glu He Arg Asp Gin Ala Leu Ser Asn Ala Gin Ala Lys Val 
485 490 405 

Val Lys Leu Glu Glu Glu Leu Lys Lys Lys Gin Val Tyr Val Asp 
500 505 pT-in 

Lys Val Glu Lys Met Gin Gin Ala Leu Val Gin Leu Gin Ala Ala 
515 520 525 

Cys Glu Lys Arg Glu Gin Leu Glu His Arg Leu Arg Thr Arg Leu 
r,-, , , ^-^^ 535 540 

Glu Arg Glu Leu Glu Ser Leu Arg He Gin Gin Arg Gin Gly Asn 

550 555 
Cys Gin Pro Thr Asn Val Ser Glu Tyr Asn Ala Ala Ala Leu Met 
^1 r 565 570 

Glu Leu Leu Arg Glu Lys Glu Glu Arg He Leu Ala Leu Glu Ala 
^"^^ 580 585 

Asp Met Thr Lys Trp Glu Gin Lys Tyr Leu Glu Glu Asn Val Met 

590 595 
Arg His Phe Ala Leu Asp Ala Ala Ala Thr Val Ala Ala Gin Arg 

610 

Asp Thr Thr val He Ser His Ser Pro Asn Thr Ser Tyr Asp Th? 

^20 625 630 

Ala Leu Glu Ala Arg He Gin Lys Glu Glu Glu Glu He Leu Met 
AT , , 640 645 

Ala Asn Lys Arg Cys Leu Asp Met Glu Gly Arg He Lys Thr Leu 

650 655 gg- 

His Ala Gin He He Glu Lys Asp Ala Met He Lys Val Leu Gin 

665 670 675 

Gin Arg Ser Arg Lys Glu Pro Ser Lys Thr Glu Gin Leu Ser Cys 

Met Arg Pro Ala Lys Ser Leu Met Ser He Ser Asn Ala Gly ler 
r,-, . 700 705 

Gly Leu Leu Ser His Ser Ser Thr Leu Thr Gly Ser Pro He Met 

710 715 720 

Glu Glu Lys Arg Asp Asp Lys Ser Trp Lys Gly Ser Leu Gly He 
T '^^^ 730 735 

Leu Leu Gly Gly Asp Tyr Arg Ala Glu Tyr Val Pro Ser Thr Pro 

740 745 
ser Pro Val Pro Pro Ser Thr Pro Leu Leu Ser Ala His Ser Lys 

755 760 765 

Thr Gly Ser Arg Asp Cys Ser Thr Gin Thr Glu Arg Gly Thr Glu 

770 775 -,on 

Ser Asn Lys Thr Ala Ala Val Ala Pro He Ser Val Pro Ala Pro 

785 790 795 
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Val Ala Ala Ala Ala Thr Ala Ala Ala lie Thr Ala Thr Ala Ala 

300 805 810 

He He Thr Ser He Met Val Ala Ala Ala Pro Val Ala Val Ala 

820 825 
Ala Ala Ala Ala Pro Ala Ala Ala Ala Ala Pro Ser Pro Ala Thr 

830 835 840 

Ala Ala Ala Thr Ala Ala Ala Val Ser Pro Ala Ala Ala Gly Gin 

845 850 855 

lie Pro Ala Ala Ala Ser Val Ala Ser Ala Ala Ala Val Ala Pro 

860 865 870 

Ser Ala Ala Ala Ala Ala Ala Val Gin Val Ala Pro Ala Ala Pro 

875 880 885 

Ala Pro Val Pro Ala Pro Ala Leu Val Pro Val Pro Ala Pro Ala 

890 895 900 

Ala Ala Gin Ala Ser Ala Pro Ala Gin Thr Gin Ala Pro Thr Ser 

905 910 9-J.5 

Ala Pro Ala Val Ala Pro Thr Pro Ala Pro Thr Pro Thr Pro Ala 

920 925 
Val Ala Gin Ala Glu Val Pro Ala Ser Pro Ala Thr Gly Pro Gly 

935 940 945 

Pro His Arg Leu Ser He Pro Ser Leu Thr Cys Asn Pro Asp Lys 

955 960 
Thr Asp Gly Pro Val Phe His Ser Asn Thr Leu Glu Arg Lys Thr 

965 970 975 

Pro He Gin He Leu Gly Gin Glu Pro Asp Ala Glu Met Val Glu 

980 985 990 

Tyr Leu He 



<210> 8 

<211> 204 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7502151CD1 

<400> 8 

Met Glu Ala Ala Gly Ser Pro Ala Ala Thr Glu Thr Gly Lys Tyr 
■'• ' 5 10 15 

He Ala Ser Thr Gin Arg Pro Asp Gly Thr Trp Arg Lys Gin Arg 
20 25 30 

Arg Val Lys Glu Gly Tyr Val Pro Gin Glu Glu Val Pro Val Tyr 
35 40 45 

Glu Asn Lys Tyr Val Lys Phe Phe Lys Ser Lys Pro Glu Leu Pro 
50 55 go 

Pro Gly Leu Ser Pro Glu Ala Thr Ala Pro Val Thr Pro Ser Arg 
65 70 75 

Pro Glu Gly Gly Glu Pro Gly Leu Ser Lys Thr Ala Lys Arg Asn 
80 85 90 

Leu Lys Arg Lys Glu Lys Arg Arg Gin Gin Gin Glu Lys Gly Glu 
95 100 105 

Ala Glu Ala Leu Ser Arg Thr Leu Asp Lys Glu Ser Leu Glu Glu 

110 115 
Thr Ala Gin Leu Pro Ser Ala Pro Gin Gly Ser Arg Ala Ala Pro 
125 130 135 

Thr Ala Ala Ser Asp Gin Pro Asp Ser Ala Ala Thr Thr Glu Lys 

145 150 
Ala Lys Lys He Lys Asn Leu Lys Lys Lys Leu Arg Gin Val Glu 

r,-, ■'■^^ 160 165 

Glu Leu Gin Gin Arg He Gin Ala Gly Glu Val Ser Gin Pro Ser 
1"^0 175 180 
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Lys Glu Gin hen Glu Lys Leu Ala Arg Arg Arg Ala Leu Glu Glu 
185 190 155 

Glu Leu Glu Asp Leu Glu Leu Gly Leu 
200 

<210> 9 
<211> 630 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_.f eature 

<223> Incyte ID No: 7505983CD1 

<400> 9 



Met 


Gly 


Arg 


Lys 


Leu 


Asp 


Pro 


Thr 


Lys 


Glu 


Lys Arg Gly Pro 


uiy 


1 








5 










1 n 
10 








Arg 


Lys 


Ala 


Arg 


Lys 


Gin 


Lys 


Gly Ala 


Glu 


Thr Glu Leu Val 


Arg 










20 










25 








Phe 


Leu 


Pro 


Ala 


Val 


Ser 


Asp 


Glu 


Asn 


Ser 


Lys Arg Leu 


Ser 


S©ir 










35 










40 








Arg Ala 


Arg 


Lys 


Arg 


Ala 


Ala 


Lys 


Arg 


Arg 


Leu Gly Ser 


Val 












50 










55 






bU 


Ala 


Pro 


Lys 


Thr 


Asn 


Lys 


Ser 


Pro 


Glu 


TV T 

Ala 


Lys Pro Leu 


Pro 


Gly 










65 










70 






/ D 


Lys 


Leu 


Pro 


Lys 


Gly 


lie 


Ser 


Ala 


Gly 


Aia 


Val Gin Thr 


Ala 


Gly 










80 










85 






o n 

y u 


Lys 


Lys 


Gly 


Pro 


Gin 


Ser 


Leu 


Phe 


Asn 


Ala 


Pro Arg Gly Lys 


Lys 










95 










100 






105 


Arg 


Pro 


Ala 


Pro 


Gly 


Ser 


Asp 


Glu 


Glu 


Glu 


Glu Glu Phe 


Leu 


GXU 










110 










115 






ton 


Ala 


Asn 


Glu 


Val 


Pro 


Arg 


Pro 


Val 


Thr 


Leu 


Arg Thr Asn 


Thr 


Leu 










125 










130 






1 *3 C 

X 


Lys 


Thr 


Arg 


Arg 


Arg 


Asp 


Leu 


Ala 


Gin 


Ala 


Leu He Asn Arg 


Cjiy 










140 










145 








Val 


Asn 


Leu 


Asp 


Pro 


Leu 


Gly 


Lys 


Trp 


Ser 


Lys Thr Gly Leu 


val 










155 










160 






loo 


Val 


Tyr 


Asp 


Ser 


Ser 


vaj. 


P3ro 


He Gly 




Thr Pro Glu 


Tyr 


Lsu 










170 










175 






180 


Ala Gly 


His 


Tyr 


Met 


Leu 


Gin 


Gly Ala 


Ser 


Ser Met Leu 


Pro 


Val 










185 










190 






195 


Met 


Ala 


Leu 


Ala 


Pro 


Gin 


Glu 


His 


Glu 


Arg 


He Leu Asp 


Met 


Cys 










200 










205 






210 


Cys 


Ala 


Pro 


Gly 


Gly 


Lys 


Thr 


Ser 


Tyr 


Met 


Ala Gin Leu 


Met 


Lys 










215 










220 






225 


Asn 


Thr 


Gly 


Val 


He 


Leu 


Ala 


Asn 


Asp 


Ala 


Asn Ala Glu 


Arg 


Leu 










230 










235 






240 


Lys 


Ser 


Val 


Val 


Gly 


Asn 


Leu 


His 


Arg 


Leu 


Gly Val Thr 


Asn 


Thr 










245 










250 






255 


He 


He 


Ser 


His 


Tyr 


Asp 


Gly 


Arg 


Gin 


Phe 


Pro Lys Val 


Val 


Gly 










260 










265 






270 


Gly 


Phe 


Asp 


Arg 


Val 


Leu 


Leu 


Asp Ala 


Pro 


Cys Ser Gly Thr 


Gly 










275 










280 






285 


Val 


He 


Ser 


Lys 


Asp 


Pro 


Ala 


Val 


Lys 


Thr 


Asn Lys Asp 


Glu 


Lys 










290 










295 






300 


Asp 


He 


Leu 


Arg 


Cys 


Ala 


His 


Leu 


Gin 


Lys 


Glu Leu Leu 


Leu 


Ser 










305 










310 






315 


Ala 


He 


Asp 


Ser 


Val 


Asn 


Ala 


Thr 


Ser 


Lys 


Thr Gly Gly Tyr 


Leu 










320 










325 






330 


Val 


Tyr 


Cys 


Thr 


Cys 


Ser 


He 


Thr 


Val 


Glu 


Glu Asn Glu 


Trp 


Val 










335 










340 






345 


Val 


Asp 


Tyr 


Ala 


Leu 


Lys 


Lys 


Arg Asn 


Val 


Arg Leu Val 


Pro 


Thr 



350 355 360 



13/89 



PCT/US02/39133 

Gly Leu Asp Phe Gly Gin Glu Gly Phe Thr Arg Phe Arg Glu Arg 
Arg Phe His Pro Ser .eu Arg Ser .hr Arg Phe Tyr Pro JIs 
Thr His Asn Met Asp Gly Phe Phe He III l.ys Phe Lys Lys Phe 
Ser Asn Ser He Pro Gin Ser Gin Thr oJy Asn Ser Glu Thr til 
Thr Pro Thr Asn Val Asp Leu Pro Gin ill xle Pro Lys Ser Glu 



Asn ser Ser Gin Pro Ala Lys Lys Ala £s Gly Ala Ala Lys Sr 



Lys Gin Gin Leu Gin Lys Gin Gin His Pro Lys Lys Ala Ser Phe 



455 



am Lys Leu Asn Gly xle Ser Lys Gly Z Asp Ser Glu Leu sfr 



470 



Thr val Pro Ser Val Thr Lys Thr Gin III Ser Ser Ser Phe Gl'n 



485 



ASP ser Ser Gin Pro Ala Gly Lys Ala X Gly lie Arg Glu ill 
I-ys val Thr Gly Lys Leu Lys Gin Arg III Pro Lys Leu Gin Hi 
ser Lys Lys Val Ala Phe Leu Arg Gin Isn Ala Pro Pro Lys lly 
Thr ASP Thr Gin Thr Pro Ala Val Leu III Pro Ser Lys Thr 
Ala Thr Leu Lys Pro Lys Asp His His Gin Pro Leu Gly Arg III 
Lys Gly val Glu Lys Gin Gin Leu Pro g!u Gin Pro Phe Glu Lys 
Ala Ala Phe Gin Lys Gin Asn Asp Thr III Lys Gly Pro Gin III 



Pro Thr val Ser Pro Xle Arg Ser Ser Pro Pro Pro Ala Lys 



Arg Lys Lys Ser Gin Ser Arg Gly Asn s'er Gin Leu Leu Leu s'er 



630 



620 

<210> 10 

<211> 129 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505986CD1 

<400> 10 

Met Gly Arg Lys Leu Asp Pro Thr Lys Glu Lys Arg Gly Pro Gly 
Arg Lys Ala Arg Lys Gin Lys Gly Ala Gl'u Thr Glu Leu Val Arg 
Phe Leu Pro Ala Val Ser Asp Glu Asn Ser Lys Arg Leu Ser sll 



Arg Ala Arg Lys Arg Ala Ala Lys Arg Arg Leu Gly Ser Val Gl'u 
Ala Pro Lys Thr Asn Lys Ser Pro Glu Al'a Lys Pro Leu Pro Gl'y 
Lys Leu Pro Lys Gly lie Ser Ala Gly Al'a Val Gin Thr Ala Gl'y 
Lys Lys Gly Pro Gin Ser Leu Phe Asn £ Pro Arg Gly Lys Lys 
Arg Pro Leu Ser Pro Gly Trp Asn Ser '^l .eu Cys Glu Asp III 
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110 115 120 

Phe Ser Thr Val His Thr His Glu lie 
125 

<210> 11 

<211> 607 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3231075CD1 



<400> 11 


























Met 


Val 


Thr 


Asn 


Lys 


Met 


Thr 


Ala 


Ala 


Phe 


Arg 


Asn 


Pro 


Ser 


Gly 


1 








5 










10 










15 


Lys 


Gin 


Val 


Ala 


Thr- 
20 


Asp 


Lys 


Val 


Ala 


Glu 
25 


Lys 


Leu 


Ser 


Ser 


Thr 
30 


Leu 


Ser 


Trp 


Val 


Lys 
35 


Asn 


Thr 


Val 


Ser 


His 
40 


Thr 


Val 


Ser 


Gin 


Met 
45 


Ala 


Ser 


Gin 


Val 


Ala 
50 


Ser 


Pro 


Ser 


Thr 


Ser 
55 


Leu 


His 


Thr 


Thr 


Ser 
60 


Ser 


Ser 


Thr 


Thr 


Leu 
65 


Ser 


Thr 


Pro 


Ala 


Leu 
70 


Ser 


Pro 


Ser 


Ser 


Pro 
75 


Ser 


Gin 


Leu 


Ser 


Pro 
80 


Asp 


Val 


Leu 


Glu 


Leu 
85 


Leu 


Ala 


Lys 


Leu 


Glu 
90 


Glu 


Gin 


Asn 


He 


Leu 
95 


Leu 


Glu 


Thr 


Asp 


Ser 
100 


Lys 


Ser 


Leu 


Arg 


Ser 
105 


Val 


Asn 


Gly 


Ser 


Arg 
110 


Arg 


Asn 


Ser 


Gly 


Ser 

115 


Ser 


Leu 


Val 


Ser 


Ser 

120 


Ser 


Ser 


Ala 


Ser 


Ser 

125 


Asn 


Leu 


Ser 


His 


Leu 

130 


Glu 


Glu 


Asp 


Ser 


Trp 

135 


He 


Leu 


Trp 


Gly 


Arg 
140 


He 


Val 


Asn 


Glu 


Trp 

145 


Glu 


Asp 


Val 


Arg 


Lys 
150 


Lys 


Lys 


Glu 


Lys 


Gin 

155 


Val 


Lys 


Glu 


Leu 


Val 
160 


His 


Lys 


Gly 


He 


Pro 
165 


His 


His 


Phe 


Arg 


Ala 
170 


He 


Val 


Trp 


Gin 


Leu 
175 


Leu 


Cys 


Ser 


Ala 


Gin 
180 


Ser 


Met 


Pro 


He 


Lys 
185 


Asp 


Gin 


Tyr 


Ser 


Glu 
190 


Leu 


Leu 


Lys 


Met 


Thr 
195 


Ser 


Pro 


Cys 


Glu 


Lys 
200 


Leu 


He 


Arg 


Arg 


Asp 
205 


He 


Ala 


Arg 


Thr 


Tyr 
210 


Pro 


Glu 


His 


Asn 


Phe 
215 


Phe 


Lys 


Glu 


Lys 


Asp 
220 


Ser 


Leu 


Gly 


Gin 


Glu 
225 


Val 


Leu 


Phe 


Asn 


Val 
230 


Met 


Lys 


Ala 


Tyr 


Ser 
235 


Leu 


Val 


Asp 


Arg 


Glu 
240 


Val Gly 


Tyr 


Cys 


Gin 


Gly 


Ser 


Ala 


Phe 


He 


Val 


Gly 


Leu 


Leu 


Leu 










245 










250 










255 


Met 


Gin 


Met 


Pro 


Glu 

260 


Glu 


Glu 


Ala 


Phe 


Cys 
265 


Val 


Phe 


Val 


Lys 


Leu 
270 


Met 


Gin 


Asp 


Tyr 


Arg 
275 


Leu 


Arg 


Glu 


Leu 


Phe 
280 


Lys 


Pro 


Ser 


Met 


Ala 
285 


Glu 


Leu 


Gly 


Leu 


Cys 
290 


Met 


Tyr 


Gin 


Phe 


Glu 
295 


Cys 


Met 


He 


Gin 


Glu 
300 


His 


Leu 


Pro 


Glu 


Leu 

305 


Phe 


Val 


His 


Phe 


Gin 

310 


Ser 


Gin 


Ser 


Phe 


His 

315 


Thr 


Ser 


Met 


Tyr 


Ala 
320 


Ser 


Ser 


Trp 


Phe 


Leu 
325 


Thr 


He 


Phe 


Leu 


Thr 
330 


Thr 


Phe 


Pro 


Leu 


Pro 
335 


Val 


Ala 


Thr 


Arg 


He 
340 


Phe 


Asp 


He 


Phe 


Met 
345 


Ser 


Glu 


Gly 


Leu 


Glu 
350 


He 


Val 


Phe 


Arg 


Val 

355 


Gly 


Leu 


Ala 


Leu 


Leu 
360 


Gin Met 


Asn 


Gin 


Ala 


Glu 


Leu 


Met 


Gin 


Leu 


Asp 


Met 


Glu 


Gly 


Met 
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365 370 375 


Leu 


Gin 


His 


Phe 


Gin Lys Val lie Pro His Gin Phe Asp Gly Val 
380 385 390 


Pro 


Asp 


Lys 


Leu 


He Gin Ala. Ala IVr* nln V;=»T Tt\r<=i Tvt- Acm Qa-v 

395 400 405 


Lys 


Lys 


Met 


Lys 


Lvs Leu Glu Lvs Glu Tvr Thr Thi- Tle> T.v« Vhr- 
410 415 420 


Lys 


Glu 


Met 


Glu 


Glu Gin Val Glu lie Lvs Arcr Tipn At-ct Th-r Pin 
425 430 435 


Asn 


Arg 


Leu 


Leu 


Lys Gin Arg lie Glu Thr Leu Glu Lys His Lys 
440 445 450 


Cys 


Ser 


Ser 


Asn 


Tyr Asn Glu Asp Phe Val Leu Gin Leu Glu Lys 
455 460 465 


Glu 


Leu 


Val 


Gin 


Ala Arg Leu Ser Glu Ala Glu Ser Gin Cys Ala 
470 475 480 


Leu 


Lvs 


Glu 


Met 


Gin Asp Lys Val Leu Asp lie Glu Lys Arg Asn 
485 490 495 


Asn 


Ser 


Leu 


Pro Ajsp Glu Asn Asn lie Ala Arg Leu Gin Glu Glu 










500 505 510 


Xieu 


lie 


Ala 


Val 


Lys Leu Arg Glu Ala Glu Ala lie Met Gly Leu 

515 520 525 


Lys 


Glu 


Leu 


Arg 


Gin Gin Val Lys Asp Leu Glu Glu His Trp Gin 
530 535 540 




His 


Leu 


Ala 


Arg Thr Thr Gly Arg Trp Lys Asp Pro Pro Lys 
545 550 555 


Lys 


Asn 


Ala 


Met 


Asn Glu Leu Gin Asp Glu Leu Met Thr lie Arg 
560 565 570 


Leu 


Arg 


Glu 


Ala 


Glu Thr Gin Ala Glu lie Arg Glu He Lys Gin 
575 580 585 


Arg 


Met 


Met 


Glu 


Met Glu Thr Gin Leu Val Glu Ser Ala Asp Val 
590 595 600 


Glu 


Pro 


Ala 


Tyr 


Met Lys Gly 
605 



<210> 12 

<211> 105 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7503516CD1 



<400> 12 

Met Gly Gly Ala Gly He Leu Leu Leu Leu Leu Ala Gly Ala Gly 
15 10 15 

Val Val Val Ala Trp Arg Pro Pro Lys Gly Lys Cys Pro Leu Arg 

20 25 30 

Cys Ser Cys Ser Lys Asp Ser Ala Leu Cys Glu Gly Ser Pro Asp 

35 40 45 

Leu Pro Val Ser Phe Ser Pro Thr Leu Leu Ser Leu Ser Leu Val 

50 55 60 

Arg Thr Gly Val Thr Gin Leu Lys Ala Gly Ser Phe Leu Arg He 

65 70 75 

Pro Ser Leu His Leu Leu Leu Phe Thr Ser Asn Ser Phe Ser Val 

80 85 90 

He Glu Asp Asp Ala Phe Ala Gly Leu Ser His Leu Gin Tyr Leu 

95 100 105 



<210> 13 
<211> 552 
<212> PRT 
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<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506179CX)1 

<400> 13 

Met Ser Asn Glu Val Glu Thr Ser Ala Thr Asn Gly Gin Pro Asp 
15 10 15 

Gin Gin Ala Ala Pro Lys Ala Pro Ser Lys Lys Glu Lys Lys Lys 
20 25 30 

Gly Pro Glu Lys Thr Asp Glu Tyr Leu Leu Ala Arg Phe Lys Gly 
35 40 45 

Asp Gly Val Lys Tyr Lys Ala Lys Leu He Gly He Asp Asp Val 
50 55 60 

Pro Asp Ala Arg Gly Asp Lys Met Ser Gin Asp Ser Met Met Lys 
65 70 75 

Leu Lys Gly Met Ala Ala Ala Gly Arg Ser Gin Gly Gin His Lys 
80 85 90 

Gin Arg He Trp Val Asn He Ser Leu Ser Gly He Lys He He 
95 100 105 

Asp Glu Lys Thr Gly Val He Glu His Glu His Pro Val Asn Lys 
110 115 120 

He Ser Phe He Ala Arg Asp Val Thr Asp Asn Arg Ala Phe Gly 
125 130 135 

Tyr Val Cys Gly Gly Glu Gly Gin His Gin Phe Phe Ala He Lys 
140 145 15Q 

Thr Gly Gin Gin Ala Glu Pro Leu Val Val Asp Leu Lys Asp Leu 
155 160 165 

Phe Gin Val He Tyr Asn Val Lys Lys Lys Glu Glu Glu Lys Lys 

170 175 
Lys He Glu Glu Ala Ser Lys Ala Val Glu Asn Gly Ser Glu Ala 
185 190 195 

Leu Met He Leu Asp Asp Gin Thr Asn Lys Leu Lys Ser Gly Val 
200 205 210 

Asp Gin Met Asp Leu Phe Gly Asp Met Ser Thr Pro Pro Asp Leu 
215 220 225 

Asn Ser Pro Thr Ser Ser Ala Asn Asp Leu Leu Ala Ser Asp He 
230 235 240 

Phe Ala Pro Pro Val Ser Glu Pro Ser Gly Gin Ala Ser Pro Thr 
245 250 255 

Gly Gin Pro Thr Ala Leu Gin Pro Asn Pro Leu Asp Leu Phe Lys 
260 265 270 

Thr Ser Ala Pro Ala Pro Val Gly Pro Leu Val Gly Leu Gly Gly 
275 280 285 

Val Thr Val Thr Leu Pro Gin Ala Gly Pro Trp Asn Thr Ala Ser 
290 295 300 

Leu Val Phe Asn Gin Ser Pro Ser Met Ala Pro Gly Ala Met Met 
305 310 315 

Gly Gly Gin Pro Ser Gly Phe Ser Gin Pro Val He Phe Gly Thr 
320 325 330 

Ser Pro Ala Val Ser Gly Trp Asn Gin Pro Ser Pro Phe Ala Ala 
335 340 345 

Ser Thr Pro Pro Pro Val Pro Val Val Trp Gly Pro Ser Ala Ser 
350 355 360 

Val Ala Pro Asn Ala Trp Ser Thr Thr Ser Pro Leu Gly Asn Pro 
365 370 375 

Phe Gin Ser Asn He Phe Pro Ala Pro Ala Val Ser Thr Gin Pro 
380 385 390 

Pro Ser Met His Ser Ser Leu Leu Val Thr Pro Pro Gin Pro Pro 

395 400 
Pro Arg Ala Gly Pro Pro Lys Asp He Ser Ser Asp Ala Phe Thr 
410 415 420 
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Ala Leu Asp Pro Leu Gly Asp 

425 

Met Phe Lys Asp Phe Gin Leu 

440 

Arg Lys Gly Glu Gin Thr Ser 
455 

Ser Tyr Phe Asn Ser Lys Val 
470 

His Asp Asp Phe Asp Ala Asn 

485 

Pro Pro Lys Pro Ala Pro Arg 

500 

Ser Thr Asp Asn Ala Phe Glu 
515 

Gly Ser Ser Gin Ala Ser Val 
530 

Glu Met Tyr Arg Asp Pro Phe 
545 

<210> 14 
<211> 54 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 1938744CD1 

<400> 14 

Met Ser Ala Asn Arg Arg Trp Trp Val 

1 5 
Cys Val Ser Glu Lys Leu Leu Arg Lys 
20 

Val Pro lie Gly Ala Val Tyr Thr Met 

35 

Arg Met Ala Gin Asp Ala Gly Glu Lys 
50 

<210> 15 

<211> 479 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5557436CD1 

<400> 15 



Met Ser 


Ser 


Pro 


Leu 


Ala 


Ser 


Leu 


Ser 


Lys Thr Arg Lys Val Pro 


1 






5 










10 15 


Leu Pro 


Ser 


Glu 


Pro 


Met 


Asn 


Pro Gly Arg Arg Gly He Arg He 








20 










25 30 


Tyr Gly 


Asp 


Val 


Pro 
35 


Ala 


Ser 


His 


Asp 


Ser Glu Leu Met Ala Phe 
40 45 


Met Thr 


Arg 


Lys 


Leu 
50 


Trp 


Asp 


Leu 


Glu 


Gin Gin Val Lys Ala Gin 

55 60 


Thr Asp 


Glu 


He 


Leu 

65 


Ser 


Lys 


Asp 


Gin 


Lys He Ala Ala Leu Glu 
70 75 


Asp Leu 


Val 


Gin 


Thr 
80 


Leu 


Arg 


Pro 


His 


Pro Ala Glu Ala Thr Leu 
85 90 


Gin Arg 


Gin 


Glu 


Glu 
95 


Leu 


Glu 


Thr 


Met 


Cys Val Gin Leu Gin Arg 
100 105 


Gin Val 


Arg 


Glu 


Met 


Glu 


Arg 


Phe 


Leu 


Ser Asp Tyr Gly Leu Gin 



Lys Glu 


He 


Lys Asp Val 




Glu 




430 










435 


Arg Gin 


Pro 


Pro 




va.JL 




Ala 




445 










J w 


Ser Gly Thr Leu 


Ser 




Php> 


al a 

rA J, o. 




460 












Gly He 


Pro 


Gin 




ASH 


nXd. 


Asp 




475 










480 


Gin Leu 


Leu 


Asn 


Lys 


He 


Asn 


Glu 




490 










495 


Gin Val 


Ser 


Leu 


Pro 


Val 


Thr 


Lys 




505 










510 


Asn Pro 


Phe 


Phe 


Lys 


Asp 


Ser 


Phe 




520 










525 


Ala Ser 


Ser 


Gin 


Pro 


Val 


Ser 


Ser 




535 










540 


Gly Asn 


Pro 


Phe 


Ala 









Pro Pro Asp Asp Glu Asp 
10 15 

Thr Arg Glu Ser Pro Leu 
25 30 

Tyr Ser Asp Tyr Val Lys 
40 45 
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110 




lis 

J. X J 


ion 


Trp Val 


Gly 


Glu 


Pro 


Met 


f^oy ^xn ijj,u Asp oer LiXu. oQX ijys 


inr 








125 




X J u 


i. JO 


Val Ser 


Glu 


His 


Gly Glu 


-Tixy Afap *rp jyieu xxir Axa xiys Jjys 


Phe 








140 






1I>U 


Trp Lys 


Pro 


Gly Asp 


Ser 


T.aii 2il^ Di^i-^ Ti>-y-i r^T 11 T/aT DVi^ 

xjt=u Axd fro jrro uxu vax Asp jrne 


Asp 








155 




1 fin 

XDU 


ICC 


Arg Leu 


Leu 


Ala 


Ser 


Leu 


v:2xxi riap xieu oex v:jxu Xjeu vaj. vax 


(jIU 








170 




X / D 


Ton 


Gly Asp Thr 


Gin Val 


Thr 


Jriro vajL FiTo LjjLy vjriy aib. Arg Leu 


Arg 








185 




1 Q n 
X y u 


ly 3 


Thr Leu 


Glu 


Pro 


He 


Pro 


Leu Lys Leu Tyr Arg Asn Gly lie 


Met 








200 




one: 
z U J 


210 


Met Phe Asp 


Gly 


Pro 


Phe 


Gin Pro Phe Tyr Asp Pro Ser Thr 


Gin 








215 




o o n 


225 


Arg Cys 


Leu 


Arg Asp 


He 


Leu Asp Gly Phe Phe Pro Ser Glu 


Leu 








230 






240 


Gin Arg Leu 


Tyr 


Pro 


Asn 


vji.y vax pro Pne Lys vai ser Asp 


Leu 








245 




o c n 


255 


Arg Asn 


Gin 


Val 


Tyr 


Leu 


Glu Asp Gly Leu Asp Pro Phe Pro 


Gly 








260 




O C 


270 


Glu Gly Arg 


Val 


Val 


Gly 


Arg Gin Arg Met His Lys Ala Leu 


Asp 








275 




o o n 


285 


Arg Val 


Glu 


Glu 


His 


Pro 


tjiy ber Arg Met Txir Ala Glu Lys 


Phe 








290 




z y J 


300 


Leu Asn 


Arg 


Leu 


Pro 


Lys 


pne vai lie Arg Gin Gly Glu Val 


He 








305 




JIU 


315 


Asp lie 


Arg 


Gly 


Pro 


He 


Arg Asp Thr Leu Gin Asn Cys Cys 


Pro 








320 






330 


Leu Pro 


Ala 


Arg 


He 


Gin 


Glu ixe Vai Val Glu Thr Pro Thr 


Leu 








335 






345 


Ala Ala 


Glu 


Arg Glu Arg 


Ser Gin Glu Ser Pro Asn Thr Pro 


Ala 








350 




c c 


360 


Pro Pro 


Leu 


Ser 


Met 


Leu 


Arg He Lys Ser Glu Asn Gly Glu 


Gin 








365 




"3 T n 


375 


Ala Phe 


Leu 


Leu 


Met 


Met 


Gin Pro Asp Asn Thr He Gly Asp 


Val 








380 






390 


Arg Ala 


Leu 


Leu 


Ala 


Gin 


Ala Arg Val Met Asp Ala Ser Ala 


Phe 








395 




400 


405 


Glu He 


Phe 


Ser 


Thr 


Phe 


Pro Pro Thr Leu Tyr Gin Asp Asp 


Thr 








410 




415 


420 


Leu Thr 


Leu 


Gin 


Ala 


Ala 


Gly Leu Val Pro Lys Ala Ala Leu 


Leu 








425 




430 


435 


Leu Arg Ala 


Arg Arg Ala 


Pro Lys Ser Ser Leu Lys Phe Ser 


Pro 








440 




445 


450 


Gly Pro 


Cys 


Pro 


Gly 


Pro 


Gly Pro Gly Pro Ser Pro Gly Pro 


Gly 








455 




460 


465 


Pro Gly 


Ser 


Ser 


Pro 


Cys 


Pro Gly Pro Ser Pro Ser Pro Gin 










470 




475 





<210> 16 

<211> 295 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506178CD1 

<4a0> 16 

Met Lys Lys Glu His Val Leu His Cys Gin Phe Ser Ala Trp Tyr 
15 10 15 

Pro Phe Phe Arg Gly Val Thr He Lys Ser Val He Leu Pro Leu 
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. 20 






25 




30 


Pro 


Gin 


Asn Val 


Lys 


Asp 


Tyr Leu Leu Asp Asp Gly Thr 


Leu Val 


Val 






35 






40 




45 


Ser 


Gly Arg Asp Asp 


Pro Pro TViT" 


His Ser Gin 


Pro 


Asp Ser 








50 






55 




60 


Asp 


Asp 


Glu Ala 


Glu 


Glu 


lie fil n Tm 


Ser Asp Asp Glu Asn Thr 


Ala 






65 






70 




75 


Thr 


Leu Thr 


Ala 


Pro 


CI 11 Plita Pttv 

\j J. U JTXICS JrjLKJ 


Glu Phe Ala Thr Lys Val 


Gin 






80 






85 




90 


Glu 


Ala He 


Asn 


Ser 


Leu Gly Gly 


Ser Val Phe 


Pro 


Lys Leu 








95 






ICQ 




105 


Asn Trp 


Ser Ala 


Pro 


Arg 


Asp Ala Tyr 


Trp He Ala 


Met 


Asn Ser 








110 






115 




120 


Ser 


Leu 


Lys Cys 


Lys 


Thr 


Leu Ser Asp 


He Phe Leu 


Leu 


Phe Lys 








125 






130 




135 


Ser 


Ser 


Asp Phe 


He 


Thr 


Arg Asp Phe 


Thr Gin Pro 


Phe 


He His 








140 






145 




150 


Cys 


Thr 


Asp Asp 


Ser 


Pro 


Asp Pro Cys 


He Glu Tyr 


Glu 


Leu Val 








155 






160 




165 


Leu Arg 


Lys Trp 


Cys 


Glu 


Leu He Pro Gly Ala Glu 


Phe 


Arg Cys 








170 






175 




180 


Phe 


Val 


Lys Glu Asn Lys 


Leu He Val 


Val Phe Asp 


He 


Tyr Arg 








185 






190 




195 


Asp 


Ser 


Arg Gly Lys 


Val 


Trp Leu He 


Asp Phe Asn 


Pro 


Phe Gly 








200 






205 




210 


Glu 


Val 


Thr Asp 


Ser 


Leu 


Leu Phe Thr 


Trp Glu Glu 


Leu 


He Ser 


Glu 






215 






220 




225 


Asn 


Asn Leu 


Asn 


Gly 


Asp Phe Ser Glu Val Asp Ala Gin Glu 








230 






235 




240 


Gin Asp 


Ser Pro 


Ala 


Phe 


Arg Cys Thr 


Asn Ser Glu 


Val 


Thr Val 


Gin 






245 






250 




255 


Pro 


Ser Pro 


Tyr 


Leu 


Ser Tyr Arg Leu Pro Lys 


Asp 


Phe Val 








260 






265 


270 


Asp 


Leu 


Ser Thr Gly Glu 


Asp Ala His 


Lys Leu He Asp Phe Leu 








275 






280 




285 


Lys 


Leu 


Lys Arg Asn 


Gin 


Gin Glu Asp Asp 












290 






295 







<210> 17 

<211> 214 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506235CD1 

<400> 17 

Met Cys He He Phe Phe Lys Phe Asp Pro Arg Pro Val Ser Lys 
15 10 15 

Asn Ala Tyr Arg Leu He Leu Ala Ala Asn Arg Asp Glu Phe Tyr 

20 25 30 

Ser Arg Pro Ser Lys Leu Ala Asp Phe Trp Gly Asn Asn Asn Glu 

35 40 45 

He Leu Ser Gly Leu Asp Met Glu Glu Gly Lys Glu Gly Gly Thr 

50 55 60 

Trp Leu Gly He Ser Thr Arg Gly Lys Leu Ala Ala Leu Thr Asn 

65 70 75 

Tyr Leu Gin Pro Gin Leu Asp Trp Gin Ala Arg Gly Arg Gly Thr 

80 85 90 

Tyr Gly Leu Ser Asn Ala Leu Leu Glu Thr Pro Trp Arg Lys Leu 

95 100 105- 

Cys Phe Gly Lys Gin Leu Phe Leu Glu Ala Val Glu Arg Ser Gin 
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110 




Ala 


Leu 


Pro 


Lys 


Asp 


Val Leu 










125 




Asn 


Asn 


Glu 


Glu 


Ala 


Gin Leu 










140 




Gly 


Gly 


Glu 


Tyr 


Val 


Gin Pro 










155 




Cys 


Val 


Arg 


Cys 


Pro 


Gly Tyr 










170 




Leu 


Val 


Asp Ala 


Asp Gly His 










185 




Met 


Asp 


Lys 


Asp 


Leu 


Ser His 










200 




Thr 


Leu 


Gin 


Ser 











115 


120 


lie 


Ala 


Ser Leu Leu Asp Val 


Leu 






130 


135 


Pro 


Asp 


Pro Ala lie Glu Asp 


Gin 






145 


150 


Met 


Leu 


Ser Lys Tyr Ala Ala 


Val 






160 


165 


Gly Thr Arg Thr Asn Thr lie 


lie 






175 


180 


Val 


Thr 


Phe Thr Glu Arg Ser Met 






190 


195 


Trp 


Glu 


Thr Arg Thr Tyr Glu 


Phe 






205 


210 



<210> 18 

<211> 1596 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1302184CD1 

<400> 18 

Met Glu Ser Arg Glu Thr Leu Ser Ser Ser Arg Gin Arg Gly Gly 
/ 5 10 15 

Glu Ser Asp Phe Leu Pro Val Ser Ser Ala Lys Pro Pro Ala Ala 
20 25 30 

Pro Gly Cys Ala Gly Glu Pro Leu Leu Ser Thr Pro Gly Pro Gly 
35 40 45 

Lys Gly lie Pro Val Gly Gly Glu Arg Met Glu Pro Glu Glu Glu 
50 55 60 

Asp Glu Leu Gly Ser Gly Arg Asp Val Asp Ser Asn Ser Asn Ala 
65 70 75 

Asp Ser Glu Lys Trp Val Ala Gly Asp Gly Leu Glu Glu Gin Glu 
80 85 90 

Phe Ser He Lys Glu Ala Asn Phe Thr Glu Gly Ser Leu Lys Leu 
95 100 105 

Lys He Gin Thr Thr Lys Arg Ala Lys Lys Pro Pro Lys Asn Leu 
110 115 120 

Glu Asn Tyr He Cys Pro Pro Glu He Lys He Thr He Lys Gin 
125 130 135 

Ser Gly Asp Gin Lys Val Ser Arg Ala Gly Lys Asn Ser Lys Ala 
140 145 150 

Thr Lys Glu Glu Glu Arg Ser His Ser Lys Lys Lys Leu Leu Thr 
155 160 165 

Ala Ser Asp Leu Ala Ala Ser Asp Leu Lys Gly Phe Gin Pro Gin 
170 175 180 

Ala Tyr Glu Arg Pro Gin Lys His Ser Thr Leu His Tyr Asp Thr 
185 190 195 

Gly Leu Pro Gin Asp Phe Thr Gly Asp Thr Leu Lys Pro Lys His 
200 205 210 

Gin Gin Lys Ser Ser Ser Gin Asn His Met Asp Trp Ser Thr Asn 
215 220 225 

Ser Asp Ser Gly Pro Val Thr Gin Asn Cys Phe He "Ser Pro Glu 
230 235 240 

Ser Gly Arg Glu Thr Ala Ser Thr Ser Lys He Pro Ala Leu Glu 
245 250 255 

Pro Val Ala Ser Phe Ala Lys Ala Gin Gly Lys Lys Gly Ser Ala 
260 265 270 

Gly Asn Thr Trp Ser Gin Leu Ser Asn Asn Asn Lys Asp Leu Leu 
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275 280 285 

Leu Gly Gly Val Ala Pro Ser Pro Ser Ser His Ser Ser Pro Ala 
290 295 300 

Pro Pro Ser Ser Ser Ala Glu Cys Asn Gly Leu Gin Pro Leu Val 
305 310 315 

Asp Gin Asp Gly Gly Gly Thr Lys Glu Pro Pro Glu Pro Pro Thr 
320 325 330 

Val Gly Ser Lys Lys Lys Ser Ser Lys Lys Asp Val He Ser Gin 
335 340 345 

Thr He Pro Asn Pro Asp Leu Asp Trp Val Lys Asn Ala Gin Lys 
350 355 360 

Ala Phe Asp Asn Thr Glu Gly Lys Arg Glu Gly Tyr Ser Ala Asp 
365 370 375 

Ser Ala Gin Glu Ala Ser Pro Ala Arg Gin Asn Val Ser Ser Ala 
380 385 390 

Ser Asn Pro Glu Asn Asp Ser Ser His Val Arg He Thr He Pro 
395 400 405 

He Lys Ala Pro Ser Leu Asp Pro Thr Asn His Lys Arg Lys Lys 
410 415 420 

Arg Gin Ser He Lys Ala Val Val Glu Lys He Met Pro Glu Lys 
425 430 435 

Ala Leu Ala Ser Gly He Thr Met Ser Ser Glu Val Val Asn Arg 
440 445 450 

He Leu Ser Asn Ser Glu Gly Asn Lys Lys Asp Pro Arg Val Pro 
455 460 465 

Lys Leu Ser Lys Met He Glu Asn Glu Ser Pro Ser Val Gly Leu 
470 475 480 

Glu Thr Gly Gly Asn Ala Glu Lys Val He Pro Gly Gly Val Ser 
485 490 495 

Lys Pro Arg Lys Pro Pro Met Val Met Thr Pro Pro Thr Cys Thr 
500 505 510 

Asp His Ser Pro Ser Arg Lys Leu Pro Glu He Gin His Pro Lys 
515 520 525 

Phe Ala Ala Lys Arg Arg Trp Thr Cys Ser Lys Pro Lys Pro Ser 
530 535 540 

Thr Met Leu Arg Glu Ala Val Met Ala Thr Ser Asp Lys Leu Met 
545 550 555 

Leu Glu Pro Pro Ser Ala Tyr Pro He Thr Pro Ser Ser Pro Leu 
560 565 570 

Tyr Thr Asn Thr Asp Ser Leu Thr Val He Thr Pro Val Lys Lys 
575 580 585 

Lys Arg Gly Arg Pro Lys Lys Gin Pro Leu Leu Thr Val Glu Thr 
590 595 600 

He His Glu Gly Thr Ser Thr Ser Pro Val Ser Pro He Ser Arg 
605 610 615 

Glu Phe Pro Gly Thr Lys Lys Arg Lys Arg Arg Arg Asn Leu Ala 
620 625 630 

Lys Leu Ala Gin Leu Val Pro Gly Glu Asp Lys Pro Met Ser Glu 
635 640 645 

Met Lys Phe His Lys Lys Val Gly Lys Leu Gly Val Leu Asp Lys 
650 555 660 

Lys Thr He Lys Thr He Asn Lys Met Lys Thr Leu Lys Arg Lys 
665 670 675 

Asn He Leu Asn Gin He Leu Ser Cys Ser Ser Ser Val Ala Leu 
680 685 690 

Lys Ala Lys Ala Pro Pro Glu Thr Ser Pro Gly Ala Ala Ala He 
695 700 705 

Glu Ser Lys Leu Gly Lys Gin He Asn Val Ser Lys Arg Gly Thr 
710 715 720 

He Tyr He Gly Lys Lys Arg Gly Arg Lys Pro Arg Ala Glu Leu 
725 730 735 

Pro Pro Pro Ser Glu Glu Pro Lys Thr Ala He Lys His Pro Arg 
740 745 750 
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Pro Val Ser Ser Gin Pro Asp Val Pro Ala Val Pro Ser Asn Phe 
755 760 765 

Gin Ser Leu Val Ala Ser Ser Pro Ala Ala Met His Pro Leu Ser 
770 775 780 

Thr Gin Leu Gly Gly Ser Asn Gly Asn Leu Ser Pro Ala Ser Thr 
785 790 795 

Glu Thr Asn Phe Ser Glu Leu Lys Thr Met Pro Asn Leu Gin Pro 
800 805 810 

lie Ser Ala Leu Pro Thr Lys Thr Gin Lys Gly He His Ser Gly 
815 820 825 

Thr Trp Lys Leu Ser Pro Pro Arg Leu Met Ala Asn Ser Pro Ser 
830 835 840 

His Leu Cys Glu He Gly Ser Leu Lys Glu He Thr Leu Ser Pro 
845 850 855 

Val Ser Glu Ser His Ser Glu Glu Thr He Pro Ser Asp Ser Gly 
860 865 870 

He Gly Thr Asp Asn Asn Ser Thr Ser Asp Gin Ala Glu Lys Ser 
875 880 885 

Ser Glu Ser Arg Arg Arg Tyr Ser Phe Asp Phe Cys Ser Leu Asp 
890 895 900 

Asn Pro Glu Ala He Pro Ser Asp Thr Ser Thr Lys Asn Arg His 
905 910 915 

Gly His Arg Gin Lys His Leu He Val Asp Asn Phe Leu Ala His 
920 925 930 

Glu Ser Leu Lys Lys Pro Lys His Lys Arg Lys Arg Lys Ser Leu 
935 940 945 

Gin Asn Arg Asp Asp Leu Gin Phe Leu Ala Asp Leu Glu Glu Leu 
950 955 960 

He Thr Lys Phe Gin Val Phe Arg He Ser His Arg Ser Tyr Thr 
965 970 975 

Phe Tyr His Glu Asn Pro Tyr Pro Ser He Phe Arg He Asn Phe 
980 985 990 

Asp His Tyr Tyr Pro Val Pro Tyr He Gin Tyr Asp Pro Leu Leu 
995 1000 1005 

Tyr Leu Arg Arg Thr Ser Asp Leu Lys Ser Lys Lys Lys Arg Gly 
1010 1015 1020 

Arg Pro Ala Lys Thr Asn Asp Thr Met Thr Lys Val Pro Phe Leu 
1025 1030 1035 

Gin Gly Phe Ser Tyr Pro He Pro Ser Gly Ser Tyr Tyr Ala Pro 
1040 1045 1050 

Tyr Gly Met Pro Tyr Thr Ser Met Pro Met Met Asn Leu Gly Tyr 
1055 1060 1065 

Tyr Gly Gin Tyr Pro Ala Pro Leu Tyr Leu Ser His Thr Leu Gly 
1070 1075 1080 

Ala Ala Ser Pro Phe Met Arg Pro Thr Val Pro Pro Pro Gin Phe 
1085 1090 1095 

His Thr Asn Ser His Val Lys Met Ser Gly Ala Ala Lys His Lys 
1100 1105 1110 

Ala Lys His Gly Val His Leu Gin Gly Pro Val Ser Met Gly Leu 
1115 1120 1125 

Gly Asp Met Gin Pro Ser Leu Asn Pro Pro Lys Val Gly Ser Ala 
1130 1135 1140 

Ser Leu Ser Ser Gly Arg Leu His Lys Arg Lys His Lys His Lys 
1145 1150 1155 

His Lys His Lys Glu Asp Arg He Leu Gly Thr His Asp Asn Leu 
1160 1165 1170 

Ser Gly Leu Phe Ala Gly Lys Ala Thr Gly Phe Ser Ser His He 
1175 1180 1185 

Leu Ser Glu Arg Leu Ser Ser Ala Asp Lys Glu Leu Pro Leu Val 
1190 1195 1200 

Ser Glu Lys Asn Lys His Lys Glu Lys Gin Lys His Gin His Ser 
1205 1210 1215 

Glu Ala Gly His Lys Ala Ser Lys Asn Asn Phe Glu Val Asp Thr 
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1220 1225 1230 

Leu Ser Thr Leu Ser Leu Ser Asp Ala Gin His Trp Thr Gin Ala 
1235 1240 1245 

Lys Glu Lys Gly Asp Leu Ser Ser Glu Pro Val Asp Ser Cys Thr 
1250 1255 1260 

Lys Arg Tyr Ser Gly Ser Gly Gly Asp Gly Gly Ser Thr Arg Ser 
1265 1270 1275 

Glu Asn Leu Asp Val Phe Ser Glu Met Asn Pro Ser Asn Asp Lys 
1?80 1285 1290 

Trp Asp Ser Asp Val Ser Gly Ser Lys Arg Arg Ser Tyr Glu Gly 
1295 1300 1305 

Phe Gly Thr Tyr Arg Glu Lys Asp lie Gin Ala Phe Lys Met Asn 
1310 1315 1320 

Arg Lys Glu Arg Ser Ser Tyr Asp Ser Ser Met Ser Pro Gly Met 
1325 1330 1335 

Pro Ser Pro His Leu Lys Val Asp Gin Thr Ala Val His Ser Lys 
1340 1345 1350 

Asn Glu Gly Ser Val Pro Thr Met Met Thr Arg Lys Lys Pro Ala 
1355 1360 1365 

Ala Val Asp Ser Val Thr lie Pro Pro Ala Pro Val Leu Ser Leu 
1370 1375 1380 

Leu Ala Ala Ser Ala Ala Thr Ser Asp Ala Val Gly Ser Ser Leu 
1385 1390 1395 

Lys Lys Arg Phe Lys Arg Arg Glu lie Glu Ala He Gin Cys Glu 
1400 1405 1410 

Val Arg Lys Met Cys Asn Tyr Thr Lys He Leu Ser Thr Lys Lys 
1415 1420 1425 

Asn Leu Asp His Val Asn Lys Xle Leu Lys Ala Lys Arg Leu Gin 
1430 1435 2.i4Q 

Arg Gin Ser Lys Thr Gly Asn Asn Phe Val Lys Lys Arg Arg Gly 
1445 1450 1455 

Arg Pro Arg Lys Gin Pro Thr Gin Phe Asp Glu Asp Ser Arg Asp 
1460 1465 1470 

Gin Met Pro Val Leu Glu Lys Cys He Asp Leu Pro Ser Lys Arg 
1475 1480 1485 

Gly Gin Lys Pro Ser Leu Ser Pro Leu Val Leu Glu Pro Ala Ala 
1490 1495 1500 

Ser Gin Asp Thr He Met Ala Thr He Glu Ala Val He His Met 
1505 1510 1515 

Ala Arg Glu Ala Pro Pro Leu Pro Pro Pro Pro Pro Pro Pro Leu 
1520 1525 1530 

Pro Pro Pro Pro Pro Pro Pro Leu Pro Pro Pro Pro Pro Leu Pro 
1535 1540 1545 

Lys Thr Pro Arg Gly Gly Lys Arg Lys His Lys Pro Gin Ala Pro 
1550 1555 2.560 

Ala Gin Pro Pro Gin Gin Ser Pro Pro Gin Gin Pro Leu Pro Gin 
1565 1570 1575 

Glu Glu Glu Val Lys Ala Lys Arg Gin Arg Lys Ser Arg Gly Ser 
1580 1585 1590 

Glu Ser Glu Val Leu Pro 
1595 

<210> 19 

<211> 191 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506232CD1 

<400> 19 

Met Glu Ala Gin Ala Gin Gly Leu Leu Glu Thr Glu Pro Leu Gin 
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-, ^ 10 15 

Gly Thr Asp Glu Asp Ala Val Ala Ser Ala Asp Phe Ser Ser Met 
20 25 30 

Leu Ser Glu Glu Glu Lys Glu Glu Leu Lys Ala Glu Leu Val Gin 
35 40 45 

Leu Glu Asp Glu He Thr Thr Leu Arg Gin Val Leu Ser Ala Lys 
5° 55 60 

Glu Arg His Leu Val Glu He Lys Gin Lys Leu Gly Met Asn Leu 
65 70 75 

Met Asn Glu Leu Lys Gin Asn Phe Ser Lys Ser Trp His Asp Met 
80 85 90 

Gin Thr Thr Thr Ala Tyr Lys Lys Thr His Glu Thr Leu Ser His 
95 100 105 

Ala Gly Gin Lys Ala Thr Ala Ala Phe Ser Asn Val Gly Thr Ala 

110 H5 
He Ser Lys Lys Phe Gly Asp Met Arg Asn Ser Pro Thr Phe Lys 

125 130 
Ser Phe Glu Glu Arg Val Glu Thr Thr Val Thr Ser Leu Lys Thr 

140 145 
Lys Val Gly Gly Thr Asn Pro Asn Gly Gly Ser Phe Glu Glu Val 
155 160 165 

Leu Ser Ser Thr Ala His Ala Ser Ala Gin Ser Leu Ala Gly Gly 
170 175 280 

Ser Arg Arg Thr Lys Glu Glu Glu Leu Gin Cys 
185 190 

<210> 20 
<211> 143 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2585358CD1 

<400> 20 

Met Arg Asp Ala Asp Ala Asp Ala Gly Gly Gly Ala Asp Gly Gly 
15 10 15 

Asp Gly Arg Gly Gly His Ser Cys Arg Gly Gly Val Asp Thr Ala 

20 25 30 

Ala Ala Pro Ala Gly Gly Ala Pro Pro Ala His Ala Pro Gly Pro 

35 40 45 

Gly Arg Asp Ala Ala Ser Ala Ala Arg Gly Ser Arg Met Arg Pro 

^0 55 60 

Hxs He Phe Thr Leu Ser Val Pro Phe Pro Thr Pro Leu Glu Ala 

65 70 75 

Glu He Ala His Gly Ser Leu Ala Pro Asp Ala Glu Pro His Gin 

80 85 90 

Arg Val Val Gly Lys Asp Leu Thr Val Ser Gly Arg He Leu Val 

95 100 105 

val Arg Trp Lys Ala Glu Asp Cys Arg Leu Leu Arg He Ser Val 
110 115 120 

He Asn Phe Leu Asp Gin Leu Ser Leu Val Val Arg Thr Met Gin 
125 130 135 

Arg Phe Gly Pro Pro Val Ser Arg 
140 

<210> 21 

<211> 1488 

<212> PRT 

<213> Homo sapiens 

<220> 
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<221> inisc_feature 

<223> Incyte ID No: 3961495CD1 

<400> 21 

Met Gin Thr Asn Gly Leu Pro Asn Trp Gly Met Ala Val Gly Met 
^5 10 15 

Gly Ala lie lie Pro Pro His Leu Gin Gly Leu Pro Gly Ala Asn 
20 25 30 

Gly Ser Ser Val Ser Gin Val Ser Gly Gly Ser Ala Glu Gly He 
35 40 45 

Ser Asn Ser Val Trp Gly Leu Ser Pro Gly Asn Pro Ala Thr Gly 
50 55 gQ 

Asn Ser Asn Ser Gly Phe Ser Gin Gly Asn Gly Asp Thr Val Asn 
65 70 75 

Ser Ala Leu Ser Ala Lys Gin Asn Gly Ser Ser Ser Ala Val Gin 
80 85 90 

Lys Glu Gly Ser Gly Gly Asn Ala Trp Asp Ser Gly Pro Pro Ala 
95 100 105 

Gly Pro Gly He Leu Ala Trp Gly Arg Gly Ser Gly Asn Asn Gly 

115 120 
Val Gly Asn He His Ser Gly Ala Trp Gly His Pro Ser Arg Ser 
125 130 135 

Thr Ser Asn Gly Val Asn Gly Glu Trp Gly Lys Pro Pro Asn Gin 

140 145 
Hxs Ser Asn Ser Asp He Asn Gly Lys Gly Ser Thr Gly Trp Glu 
155 160 165 

Ser Pro Ser Val Thr Ser Gin Asn Pro Thr Val Gin Pro Gly Gly 
I'^O 175 180 

Glu His Met Asn Ser Trp Ala Lys Ala Ala Ser Ser Gly Thr Thr 
185 190 195 

Ala Ser Glu Gly Ser Ser Asp Gly Ser Gly Asn His Asn Glu Gly 
200 205 210 

Ser Thr Gly Arg Glu Gly Thr Gly Glu Gly Arg Arg Arg Asp Lys 
215 220 225 

Gly He He Asp Gin Gly His He Gin Leu Pro Arg Asn Asp Leu 
230 235 240 

Asp Pro Arg Val Leu Ser Asn Thr Gly Trp Gly Gin Thr Pro Val 
245 250 255 

Lys Gin Asn Thr Ala Trp Glu Phe Glu Glu Ser Pro Arg Ser Glu 
260 265 270 

Arg Lys Asn Asp Asn Gly Thr Glu Ala Trp Gly Cys Ala Ala Thr 
275 280 285 

Gin Ala Ser Asn Ser Gly Gly Lys Asn Asp Gly Ser He Met Asn 
290 295 300 

Ser Thr Asn Thr Ser Ser Val Ser Gly Trp Val Asn Ala Pro Pro 
305 310 215 

Ala Ala Val Pro Ala Asn Thr Gly Trp Gly Asp Ser Asn Asn Lys 
320 325 330 

Ala Pro Ser Gly Pro Gly Val Trp Gly Asp Ser He Ser Ser Thr 

335 340 
Ala Val Ser Thr Ala Ala Ala Ala Lys Ser Gly His Ala Trp Ser 

355 360 
Gly Ala Ala Asn Gin Glu Asp Lys Ser Pro Thr Trp Gly Glu Pro 
365 370 375 

Pro Lys Pro Lys Ser Gin His Trp Gly Asp Gly Gin Arg Ser Asn 
380 385 390 

Pro Ala Trp Ser Ala Gly Gly Gly Asp Trp Ala Asp Ser Ser Ser 
395 400 405 

Val Leu Gly His Leu Gly Asp Gly Lys Lys Asn Gly Ser Gly Trp 
410 415 420 

Asp Ala Asp Ser Asn Arg Ser Gly Ser Gly Trp Asn Asp Thr Thr 
425 430 435 

Arg Ser Gly Asn Ser Gly Trp Gly Asn Ser Thr Asn Thr Lys Ala 
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440 445 450 

Asn Pro Gly Thr Asn Trp Gly Glu Thr Leu Lys Pro Gly Pro Gin 
455 460 465 

Gin Asn Trp Ala Ser Lys Pro Gin Asp Asn Asn Val Ser Asn Trp 
470 475 480 

Gly Gly Ala Ala Ser Val Lys Gin Thr Gly Thr Gly Trp lie Gly 

485 490 
Gly Pro Val Pro Val Lys Gin Lys Asp Ser Ser Glu Ala Thr Gly 
500 505 510 

Trp Glu Glu Pro Ser Pro Pro Ser lie Arg Arg Lys Met Glu lie 
515 520 525 

Asp Asp Gly Thr Ser Ala Trp Gly Asp Pro Ser Asn Tyr Asn Asn 
530 535 540 

Lys Thr Val Asn Met Trp Asp Arg Asn Asn Pro Val lie Gin Ser 
545 550 555 

Ser Thr Thr Thr Asn Thr Thr Thr Thr Thr Thr Thr Thr Thr Ser 
560 565 570 

Asn Thr Thr His Arg Val Glu Thr Pro Pro Pro His Gin Ala Gly 
575 580 585 

Thr Gin Leu Asn Arg Ser Pro Leu Leu Gly Pro Val Ser Ser Gly 
590 595 600 

Trp Gly Glu Met Pro Asn Val His Ser Lys Thr Glu Asn Ser Trp 
605 610 615 

Gly Glu Pro Ser Ser Pro Ser Thr Leu Val Asp Asn Gly Thr Ala 
620 525 630 

Ala Trp Gly Lys Pro Pro Ser Ser Gly Ser Gly Trp Gly Asp His 
635 640 645 

Pro Ala Glu Pro Pro Val Ala Phe Gly Arg Ala Gly Ala Pro Val 
650 655 660 

Ala Ala Ser Ala Leu Cys Lys Pro Ala Ser Lys Ser Met Gin Glu 
665 670 675 

Gly Trp Gly Ser Gly Gly Asp Glu Met Asn Leu Ser Thr Ser Gin 
680 685 690 

Trp Glu Asp Glu Glu Gly Asp Val Trp Asn Asn Ala Ala Ser Gin 
695 700 705 

Glu Ser Thr Ser Ser Cys Ser Ser Trp Gly Asn Ala Pro Lys Lys 
710 715 720 

Gly Leu Gin Lys Gly Met Lys Thr Ser Gly Lys Gin Asp Glu Ala 
725 730 735 

Trp lie Met Ser Arg Leu lie Lys Gin Leu Thr Asp Met Gly Phe 
740 745 750 

Pro Arg Glu Pro Ala Glu Glu Ala Leu Lys Ser Asn Asn Met Asn 
755 760 765 

Leu Asp Gin Ala Met Ser Ala Leu Leu Glu Lys Lys Val Asp Val 
770 775 780 

Asp Lys Arg Gly Leu Gly Val Thr Asp His Asn Gly Met Ala Ala 
785 790 795 

Lys Pro Leu Gly Cys Arg Pro Pro He Ser Lys Glu Ser Ser Val 
800 805 810 

Asp Arg Pro Thr Phe Leu Asp Lys Asp Gly Gly Leu Val Glu Glu 
815 820 825 

Pro Thr Pro Ser Pro Phe Leu Pro Ser Pro Ser Leu Lys Leu Pro 
830 835 840 

Leu Ser His Ser Ala Leu Pro Ser Gin Ala Leu Gly Gly He Ala 
845 850 855 

Ser Gly Leu Gly Met Gin Asn Leu Asn Ser Ser Arg Gin He Pro 
860 865 870 

Ser Gly Asn Leu Gly Met Phe Gly Asn Ser Gly Ala Ala Gin Ala 
875 880 885 

Arg Thr Met Gin Gin Pro Pro Gin Pro Pro Val Gin Pro Leu Asn 
890 895 900 

Ser Ser Gin Pro Ser Leu Arg Ala Gin Val Pro Gin Phe Leu Ser 
905 910 915 
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Pro Gin Val Gin Ala Gin Leu Leu Gin Phe Ala Ala Lys Asn lie 
920 925 930 

Gly Leu Asn Pro Ala Leu Leu Thr Ser Pro lie Asn Pro Gin His 
935 940 945 

Met Thr Met Leu Asn Gin Leu Tyr Gin Leu Gin Leu Ala Tyr Gin 
950 955 960 

Arg Leu Gin He Gin Gin Gin Met Leu Gin Ala Gin Arg Asn Val 
965 970 975 

Ser Gly Ser Met Arg Gin Gin Glu Gin Gin Val Ala Arg Thr He 
980 985 990 

Thr Asn Leu Gin Gin Gin He Gin Gin His Gin Arg Gin Leu Ala 
995 1000 1005 

Gin Ala Leu Leu Val Lys Gin Pro Pro Pro Pro Pro Pro Pro Pro 
1010 1015 1020 

His Leu Ser Leu His Pro Ser Ala Gly Lys Ser Ala Met Asp Ser 
1025 1030 1035 

Phe Pro Ser His Pro Gin Thr Pro Gly Leu Pro Asp Leu Gin Thr 
1040 1045 1050 

Lys Glu Gin Gin Ser Ser Pro Asn Thr Phe Ala Pro Tyr Pro Leu 
1055 1060 1065 

Ala Gly Leu Asn Pro Asn Met Asn Val Asn Ser Met Asp Met Thr 
1070 1075 1080 

Gly Gly Leu Ser Val Lys Asp Pro Ser Gin Ser Gin Ser Arg Leu 
1085 1090 1095 

Pro Gin Trp Thr His Pro Asn Ser Met Asp Asn Leu Pro Ser Ala 
1100 1105 1110 

Ala Ser Pro Leu Glu Gin Asn Pro Ser Lys His Gly Ala He Pro 
1115 1120 1125 

Gly Gly Leu Ser He Gly Pro Pro Gly Lys Ser Ser He Asp Asp 
1130 1135 1140 

Ser Tyr Gly Arg Tyr Asp Leu He Gin Asn Ser Glu Ser Pro Ala 
1145 1150 1155 

Ser Pro Pro Val Ala Val Pro His Ser Arg Ser Arg Ala Lys Ser 
1160 1165 1170 

Asp Ser Asp Lys He Ser Asn Gly Pro Ser He Asn Trp Pro Pro 
1175 1180 1185 

Glu Phe His Pro Gly Val Pro Trp Lys Gly Leu Gin Asn He Asp 
1190 1195 1200 

Pro Glu Asn Asp Pro Asp Val Thr Pro Gly Thr Val Pro Thr Gly 
1205 1210 1215 

Pro Thr He Asn Thr Thr He Gin Asp Val Asn Arg Tyr Leu Leu 
1220 1225 1230 

Lys Ser Gly Gly Lys Leu Ser Asp He Lys Ser Thr Trp Ser Ser 
1235 1240 1245 

Gly Pro Thr Ser His Thr Gin Ala Ser Leu Ser His Glu Leu Trp 
1250 1255 1260 

Lys Val Pro Arg Asn Ser Thr Ala Pro Thr Arg Pro Pro Pro Gly 
1265 1270 1275 

Leu Thr Asn Pro Lys Pro Ser Ser Thr Trp Gly Ala Ser Pro Leu 
1280 1285 1290 

Gly Trp Thr Ser Ser Tyr Ser Ser Gly Ser Ala Trp Ser Thr Asp 
1295 1300 1305 

Thr Ser Gly Arg Thr Ser Ser Trp Leu Val Leu Arg Asn Leu Thr 
1310 1315 1320 

Pro Gin He Asp Gly Ser Thr Leu Arg Thr Leu Cys Leu Gin His 
1325 1330 1335 

Gly Pro Leu He Thr Phe His Leu Asn Leu Thr Gin Gly Asn Ala 
1340 1345 1350 

Val Val Arg Tyr Ser Ser Lys Glu Glu Ala Ala Lys Ala Gin Lys 
1355 1360 13S5 

Ser Leu His Met Cys Val Leu Gly Asn Thr Thr He Leu Ala Glu 
1370 1375 1380 

Phe Ala Gly Glu Glu Glu Val Asn Arg Phe Leu Ala Gin Gly Gin 
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-r -"-^^^ 1390 1395 

Ala Leu Pro Pro Thr Ser Ser Trp Gin Ser Ser Ser Ala Ser Ser 

1405 1410 
Gin Pro Arg Leu Ser Ala Ala Gly Ser Ser His Gly Leu Val Arg 
1415 1420 1425 

Ser Asp Ala Gly His Trp Asn Ala Pro Cys Leu Gly Gly Lys Gly 

1435 1440 
Ser ser Glu Leu Leu Trp Gly Gly Val Pro Gin Tyr Ser Ser Ser 
-"■^^^ 1450 1455 

Leu Trp Gly Pro Pro Ser Ala Asp Asp Ser Arg Val He Gly Ser 
1^60 1465 1470 

Pro Thr Pro Leu Thr Thr Leu Leu Pro Gly Asp Leu Leu Ser Gly 

„ ^^'^^ 1480 1485' 

Glu Ser Leu 



<210> 22 

<211> 505 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_feature 

<223> Incyte ID No: 7500801CD1 

<400> 22 

Met Gly Gly Ala Gly He Leu Leu Leu Leu Leu Ala Gly Ala Glv 

10 15 

Val Val val Ala Trp Arg Pro Pro Lys Gly Lys Cys Pro Leu Arg 

20 25 30 

Cys Ser Cys Ser Lys Asp Ser Ala Leu Cys Glu Gly Ser Pro Asp 

35 40 

Leu Pro Val Ser Phe Ser Pro Thr Leu Leu Ser Leu Ser Leu Val 

Arg Thr Gly Val Thr Gin Leu Lys Ala Gly Ser Phe Leu Arg He 
r, r, 70 75 

Pro Ser Leu His Leu Leu Leu Phe Thr Ser Asn Ser Phe Ser Val 

80 g5 
He Glu Asp Asp Ala Phe Ala Gly Leu Ser His Leu Gin Tyr Leu 

Phe He Glu Asp Asn Glu He Gly Ser He Ser Lys Asn Ala III 

^■^^ 115 120 

Arg Gly Leu Arg Ser Leu Thr His Leu Ser Leu Ala Asn Asn His 

125 130 --^ 

Leu Glu Thr Leu Pro Arg Phe Leu Phe Arg Gly Leu Asp Thr Leu 

140 145 250 

Thr His Val Asp Leu Arg Gly Asn Pro Phe Gin Cys Asp Cys Arg 
T, -, -^^^ 160 165 

Val Leu Trp Leu Leu Gin Trp Met Pro Thr Val Asn Ala Ser Val 

1*^0 175 
Gly Thr Gly Ala Cys Ala Gly Pro Ala Ser Leu Ser His Met Gin 

185 190 igc 

Leu His His Leu Asp Pro Lys Thr Phe Lys Cys Arg Ala He Glu 

205 210 
Leu Ser Trp Phe Gin Thr Val Gly Glu Ser Ala Leu Ser Val Glu 
215 220 5oc: 

Pro Phe Ser Tyr Gin Gly Glu Pro His He Val Leu Ala Gin Pro 

Phe Ala Gly Arg Cys Leu He Leu Ser Trp Asp Tyr Ser Leu Gin 

250 25S 

Arg Phe Arg Pro Glu Glu Glu Leu Pro Ala Ala Ser Val Val Ser 

265 270 

Cys Lys Pro Leu Val Leu Gly Pro Ser Leu Phe Val Leu Ala Ala 
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275 280 285 

Arg Leu Trp Gly Gly Ser Gin Leu Trp Ala Arg Pro Ser Pro Gly 

290 295 300 

Leu Arg Leu Ala Pro Thr Gin Thr Leu Ala Pro Arg Arg Leu Leu 

305 310 315 

Arg Pro Asn Asp Ala Glu Leu Leu Trp Leu Glu Gly Gin Pro Cys 

320 325 330 

Phe Val Val Ala Asp Ala Ser Lys Ala Gly Ser Thr Thr Leu Leu 

335 340 345 

Cys Arg Asp Gly Pro Gly Phe Tyr Pro His Gin Ser Leu His Ala 

350 355 360 

Trp His Arg Asp Thr Asp Ala Glu Ala Leu Glu Leu Asp Gly Arg 

365 370 375 

Pro His Leu Leu Leu Ala Ser Ala Ser Gin Arg Pro Val Leu Phe 

380 385 390 

His Trp Thr Gly Gly Arg Phe Glu Arg Arg Thr Asp He Pro Glu 

395 400 405 

Ala Glu Asp Val Tyr Ala Thr Arg His Phe Gin Ala Gly Gly Asp 

410 415 420 

Val Phe Leu Cys Leu Thr Arg Tyr He Gly Asp Ser Met Val Met 

425 430 435 

Arg Trp Asp Gly Ser Met Phe Arg Leu Leu Gin Gin Leu Pro Ser 

440 445 450 

Arg Gly Ala His Gly Pro Ser Leu Leu Asp Leu Pro Ser Cys Pro 

455 460 465 

Ser Gly Leu Lys Gin His Leu Asp He Ser Ala Gly Ala Asp Ser 

470 475 480 

Pro Ser Lys Ala Thr Trp Gly Val Gin Ala Gly Ala Leu His Gly 

485 490 495 

Val Glu Val Glu Val Thr Cys Arg Ala Pro 

500 505 

<210> 23 

<211> 1544 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506414CD1 

<400> 23 

:.YS Cys He Pro Leu Trp Arg Cys Asn Arg His Val Glu 

10 15 
s Ser Leu Gin Val Val Pro Glu Glu 
25 30 
r Leu Glu Glu Leu Leu Leu Asp Ala 
40 45 

0 Lys Pro Phe Phe Arg Leu Leu Asn 
55 60 

r Asp Asn Glu He Gin Arg Leu Pro 
70 75 
t Gin Leu Val Glu Leu Asp Val Ser 
85 90 
s Pro Glu Ser He Lys Phe Cys Lys 
100 105 

1 Ser Gly Asn Pro Leu Ser Arg Leu 
115 120 

I Arg Ser Leu Ala His Leu Ala Leu 
130 135 
I Leu Pro Gly Asp Val Gly Asn Leu 
145 150 



Met Leu Lys 


Cys 


He Pro 


1 




5 


Ser Val Asp 


Lys 


Arg His 






20 


He Tyr Arg 


Tyr 


Ser Arg 






35 


Asn Gin Leu 


Arg 


Glu Leu 






50 


Leu Arg Lys 


Leu Gly Leu 






65 


Pro Glu Val 


Ala 


Asn Phe 






80 


Arg Asn Asp 


He 


Pro Glu 






95 


Ala Leu Glu 


He 


Ala Asp 






110 


Pro Asp Gly 


Phe 


Thr Gin 






125 


Asn Asp Val 


Ser 


Leu Gin 






140 


Ala Asn Leu 


Val 


Thr Leu 
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155 160 165 

Leu Pro Ala Ser Leu Ser Phe Leu Val Lys Leu Glu Gin Leu Asp 
170 175 180 

Leu Gly Gly Asn Asp Leu Glu Val Leu Pro Asp Thr Leu Gly Ala 
185 190 195 

Leu Pro Asn Leu Arg Glu Leu Trp Leu Asp Arg Asn Gin Leu Ser 
200 205 210 

Ala Leu Pro Pro Glu Leu Gly Asn Leu Arg Arg Leu Val Cys Leu 
215 220 225 

Asp Val Ser Glu Asn Arg Leu Glu Glu Leu Pro Ala Glu Leu Gly 
230 235 240 

Gly Leu Val Leu Leu Thr Asp Leu Leu Leu Ser Gin Asn Leu Leu 
245 250 • 255 

Arg Arg Leu Pro Asp Gly He Gly Gin Leu Lys Gin Leu Ser He 
260 265 270 

Leu Lys Val Asp Gin Asn Arg Leu Cys Glu Val Thr Glu Ala He 
275 280 285 

Gly Asp Cys Glu Asn Leu Ser Glu Leu He Leu Thr Glu Asn Leu 
290 295 300 

Leu Met Ala Leu Pro Arg Ser Leu Gly Lys Leu Thr Lys Leu Thr 
305 310 315 

Asn Leu Asn Val Asp Arg Asn His Leu Glu Ala Leu Pro Pro Glu 
320 325 330 

He Gly Gly Cys Val Ala Leu Ser Val Leu Ser Leu Arg Asp Asn 
335 340 345 

Arg Leu Ala Val Leu Pro Pro Glu Leu Ala His Thr Thr Glu Leu 

350 355 
His Val Leu Asp Val Ala Gly Asn Arg Leu Gin Ser Leu Pro Phe 
365 370 375 

Ala Leu Thr His Leu Asn Leu Lys Ala Leu Trp Leu Ala Glu Asn 
380 385 390 

Gin Ala Gin Pro Met Leu Arg Phe Gin Thr Glu Asp Asp Ala Arg 
395 400 405 

Thr Gly Glu Lys Val Leu Thr Cys Tyr Leu Leu Pro Gin Gin Pro 
410 415 420 

Pro Leu Ser Leu Glu Asp Ala Gly Gin Gin Gly Ser Leu Ser Glu 
425 430 435 

Thr Trp Ser Asp Ala Pro Pro Ser Arg Val Ser Val He Gin Phe 
440 445 450 

Leu Glu Ala Pro He Gly Asp Glu Asp Ala Glu Glu Ala Ala Ala 
455 460 465 

Glu Lys Arg Gly Leu Gin Arg Arg Ala Thr Pro His Pro Ser Glu 
470 475 480 

Leu Lys Val Met Lys Arg Ser He Glu Gly Arg Arg Ser Glu Ala 
485 490 495 

Cys Pro Cys Gin Pro Asp Ser Gly Ser Pro Leu Pro Ala Glu Glu 
500 505 510 

Glu Lys Arg Leu Ser Ala Glu Ser Gly Leu Ser Glu Asp Ser Arg 
515 520 525 

Pro Ser Ala Ser Thr Val Ser Glu Ala Glu Pro Glu Gly Pro Ser 
530 535 540 

Ala Glu Ala Gin Gly Gly Ser Gin Gin Glu Ala Thr Thr Ala Gly 
545 550 555 

Gly Glu Glu Asp Ala Glu Glu Asp Tyr Gin Glu Pro Thr Val His 
560 565 570 

Phe Ala Glu Asp Ala Leu Leu Pro Gly Asp Asp Arg Glu He Glu 
575 580 585 

Glu Gly Gin Pro Glu Ala Pro Trp Thr Leu Pro Gly Gly Arg Gin 
590 595 600 

Arg Leu He Arg Lys Asp Thr Pro His Tyr Lys Lys His Phe Lys 
605 610 615 

He Ser Lys Leu Pro Gin Pro Glu Ala Val Val Ala Leu Leu Gin 
620 625 630 
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Gly 


Met 


Gin 


Pro 


Asp 


Gly 


Glu 


Gly 


Pro 


Val 


Ala 


Pro Gly 


Gly Trp 










635 










640 






645 


His 


Asn 


Gly 


Pro 


His 


Ala 


Pro 


Trp 


Ala 


Pro 


Arg 


Ala Gin 


Lys Glu 










650 










655 






660 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Gly 


Ser 


Pro 


Gin 


Glu 


Glu Glu 


Glu Glu 










665 










570 






675 


Glu 


Glu 


Glu 


Glu 


Asn 


Arg 


Ala 


Glu 


Glu 


Glu 


Glu 


Ala Ser 


Thr Glu 










680 










685 






690 


Glu 


Glu 


Asp 


Lys 


Glu 


Gly 


Ala 


Val 


Val 


Ser 


Ala 


Pro Ser 


Val Lys 










695 










700 






705 


Gly 


Val 


Ser 


Phe 


Asp 


Gin 


Ala 


Asn 


Asn 


Leu 


Leu 


He Glu 


Pro Ala 










710 










715 






720 


Arg 


lie 


Glu 


Glu 


Glu 


Glu 


Leu 


Thr 


Leu 


Thr 


He 


Leu Arg 


Gin Thr 










725 










730 






735 


Gly 


Gly 


Leu 


Gly 


He 


Ser 


He 


Ala 


Gly 


Gly 


Lys 


Gly Ser 


Thr Pro 










740 










745 






750 


Tyr 


Lys 


Gly 


Asp 


Asp 


Glu 


Gly 


He 


Phe 


He 


Ser 


Arg Val 


Ser Glu 










755 










750 






765 


Glu 


Gly 


Pro 


Ala 


Ala 


Arg 


Ala 


Gly 


Val 


Arg 


Val 


Gly Asp 


Lys Leu 










770 










775 






780 


Leu 


Glu 


Val 


Asn 


Gly 


Val 


Ala 


Leu 


Gin 


Gly 


Ala 


Glu His 


His Glu 










785 










790 






795 


Ala 


Val 


Glu 


Ala 


Leu 


Arg 


Gly 


Ala 


Gly 


Thr 


Ala 


Val Gin 


Met Arg 










800 










805 






810 


Val 


Trp 


Arg 


Glu 


Arg 


Met 


Val 


Glu 


Pro 


Glu 


Asn 


Ala Val 


Thr He 










815 










820 






825 


Thr 


Pro 


Leu 


Arg 


Pro 


Glu 


Asp 


Asp 


Tyr 


Ser 


Pro 


Arg Glu 


Arg Arg 










830 










835 






840 


Gly 


Gly 


Gly 


Leu 


Arg 


Leu 


Pro 


Leu 


Leu 


Pro 


Pro 


Glu Ser 


Pro Gly 










845 










850 






855 


Pro 


Leu 


Arg 


Gin 


Arg 


His 


Val 


Ala 


Cys 


Leu 


Ala 


Arg Ser 


Glu Arg 










860 










865 






870 


Gly 


Leu 


Gly 


Phe 


Ser 


He 


Ala 


Gly 


Gly 


Lys 


Gly 


Ser Thr 


Pro Tyr 










875 










880 






885 


Arg 


Ala 


Gly 


Asp 


Ala 


Gly 


He 


Phe 


Val 


Ser 


Arg 


He Ala 


Glu Gly 










890 










895 






900 


Gly 


Ala 


Ala 


His 


Arg 


Ala 


Gly 


Thr 


Leu 


Gin 


Val 


Gly Asp 


Arg Val 










905 










910 






915 


Leu 


Ser 


Glu 


He 


Arg 


Leu 


Pro 


Arg 


Ala 


Gly 


Gly 


Pro Leu 


Gly Leu 










920 










925 






930 


Ser 


He 


Val 


Gly 


Gly 


Ser 


Asp 


His 


Ser 


Ser 


His 


Pro Phe 


Gly Val 










935 










940 






945 


Gin 


Glu 


Pro 


Gly 


Val 


Phe 


He 


Ser 


Lys 


Val 


Leu 


Pro Arg 


Gly Leu 










950 










955 






960 


Ala 


Ala 


Arg 


Ser 


Gly 


Leu 


Arg 


Val 


Gly 


Asp 


Arg 


He Leu 


Ala Val 










965 










970 






975 


Asn 


Gly 


Gin 


Asp 


Val 


Arg 


Asp 


Ala 


Thr 


His 


Gin 


Glu Ala 


Val Ser 










980 










985 






990 


Ala 


Leu 


Leu 


Arg 


Pro 


Cys 


Leu 


Glu 


Leu 


Ser 


Leu 


Leu Val 


Arg Arg 










995 








1000 






1005 


Asp 


Pro 


Ala 


Pro 


Pro 


Gly 


Leu 


Arg 


Glu 


Leu 


Cys 


He Gin 


Lys Ala 








1010 








1015 






1020 


Pro 


Gly 


Glu 


Arg 


Leu 


Gly 


He 


Ser 


He 


Arg 


Gly 


Gly Ala 


Arg Gly 








1025 








1030 






1035 


His 


Ala 


Gly 


Asn 


Pro 


Arg 


Asp 


Pro 


Thr 


Asp 


Glu 


Gly He 


Phe He 








1040 








1045 






1050 


Ser 


Lys 


Val 


Ser 


Pro 


Thr 


Gly 


Ala 


Ala Gly 


Arg 


Asp Gly 


Arg Leu 








1055 








1060 






1065 


Arg 


Val 


Gly 


Leu Arg 


Leu 


Leu 


Glu 


Val 


Asn 


Gin 


Gin Ser 


Leu Leu 








1070 








1075 






1080 


Gly 


Leu 


Thr 


His 


Gly 


Glu 


Ala 


Val 


Gin 


Leu 


Leu 


Arg Ser 


Val Gly 








1085 








1090 






1095 


Asp 


Thr 


Leu 


Thr 


Val 


Leu 


Val 


Cys 


Asp Gly 


Phe 


Glu Ala 


Ser Thr 
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1100 



ASP Ala Ala Leu^Glu Val Ser Pro Gly'val' He Ma Asn Pro'phS 

Ala Ala Oly Ile^Gly His Arg Asn sJ^ oiu Ser Xle Server 

Xle ASP Arg Glu^Leu Ser Pro Glu Gly'p^^ Gly .ys oiu ^ys'lil 

Leu pro Gly Gln^Thr Leu His Trp Gly'pro Glu Ala Thr Glu'S 

Ala Glv Ar. Cly^.eu Gin Pro .eu .ys'll^ Asp Tyr Ar. Ala'S 

Ala Ala val Pro^Ser Ala Gly Ser V.l'oln Ar. Val Pro Ser^Gl'y 

Ala Ala Gly Gly^Lys Met Ala Glu Ser'p^^ Cys Ser Pro Ser^Gl^^ 

Gin Gin Pro Pro^Ser Pro Pro Ser Pro'L'" Glu Leu Pro Ala'S 

val Lys Gin Ala^Tyr Arg Ala Phe Ala'L'f val Pro Thr Ser'SL" 

Pro Pro Glu ASP Ma Pro Ala Gin Pro'^J^ .hr Pro Gly Pro'lll 

Ala ser Pro Glu^Gla Leu Ser Phe Ar^Glu Ar. Gin Lys Tyr'p^° 

Glu Leu Glu Val^Ar. Val Pro Gin Ala'^IS Gly Pro Pro Lys'S 

val ser Leu Val^Gly Ma Asp Asp Leu'^| Lys Met Gin Glu^Glu 

Glu Ma AT. Lys^Leu Gin Gin Lys Arg'lJa^ Gin Met Leu Arg'^?^ 

Ala Ma Glu Ma^Gly Ma Glu Ma Arg'^^^ Ala Leu Asp Gly"l'u 

Thr Leu Gly Glu^Glu Glu Gin Glu Asp'^^" Gin Pro Pro .r,'lll 

ser Pro Ser Pro Thr Ser Arg Gin Ser'p^o Ma Ser Pro Pro'pro 
^■^^^ 1360 



L» Oly a,y Gly^M. p„ v.l Thr X x,ys Ma =1„ ^'l^l 



His Oln Glu Arg^wu Arg v.l 01„ s^r Pro Glu Pro Pro il."°° 
Cl» Arg Al. .eu^ser Pro Al. Glu ^'Hl a1. .eu Glu Ala'L'^ 
^y. Arg Ala .eujr. Arg Ala Ala Arg'S .y. s.r !,.„ Glu'^J° 
Al. ^ Arg «. Glu «.t v.l I^u'"? Arg Ser Glu Glu"lJ 
Arg Oly ^ Arg^Gly Pro Leu Glu Arg'S Ala Glu Ala Pro's" 

Pro Ala Pro Thr Pro Ser Pro Thr Pr-n^tli ni . ^^^^ 
1460 " Pro 

Gin Thr ser Xhr^ser Pro Gly Arg Leu HI Pro Asp Phe Ma'J!° 

Glu Leu Arg Ser^Leu Glu Pro Ser Pro tli Pro Gly Pro Gln'^f^ 

Glu ASP Gly Glujal Ma Leu Val Leu'Z^J Gly Arg Pro Ser1°o' 

Gly Ala val Gly^Pro Glu Asp Val Ma^iS Cys Ser Ser Arg"^^ 

Pro val Arg Pro Gly Arg Arg Gly Leu'S Pro Val Pro Ser'"° 

<210> 24 
<211> 1609 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7506415CD1 

<400> 24 

Met I,e„ ^y. cy, xie P.O cys ^„ 

V.1 MP I,.. ^ HI, CVS S„ .e„ ol° val „.l p„ „i„ 

^ s„ AT, =er I,e„ .1^= ^ ^0 

.!„ cl„ ^ ^45 

oa. .e. s,. ..p M„ «p ,1. ^ ^» 

ai„ VaZ «a ^„ Met 0I„ v 01„ ^ ^.1 

Arg Asn Asp lie Pro 61u lie Pro n,, all -r, 9° 

u j.xe Pro Glu Ser lie Lys Phe Cys Lys 

-a «u XI. P.e a„ 01. ,„ s„ ^, 

..p 01, ^ ^ ^ 

^ ASP V.1 s„ 01. Ma ^ P„ |J 

Ala Aa„ .au Val Tte ol„ !,,„ ^„ ^„ ^JO 

Pro M. sa, «j .a. «.a .a„ v.l £° 01„ 01„ .a„ III 
01. 01. ^p ^„ ^ UO 

LBU Pro Asn Leu Aro Glu Lsu t„ . 195 
200 ^ *K S« 

Ala .eu Pro P,o Olu ol. Aa„ .e„ A^ Ar. .au Val c.a S 
ASP val sar Olu Asn Ar. .i„ „^ 
Oly .au val |= ^ 240 

AT. Arg L„ P„ ASP 01. n. 01. 01„ !Z ..a ol„ ^ 3er ^l'^ 
:..u ..a v.1 A.P 01„ A» Ar, .eu Ca III val Thr olu Ala III 
«1. ASP C.a Olu Aa„ l,a„ Ser Olu ^ lH ^ ^„ ^ 
.eu Met Ala .au «o Ar. Ser Leu 01. S .au ..a .au X 
^ ^ Aan v.1 Aap Ar, Aau His .au | a1. p„ p„ 
lie 01. 01. c val Ala .eu Ser Val sar .au Ar, Aap III 
Aro .au Ala Val p„ p„ ^ ^ ^ 145 

Hla V.1 .au Aap V.l Al. 01. Asn Ar, ol„ Ser .eu P„ Z 
AU .eu Thr Hla ^ Aan Lau .y, a1. 

31n Al. 01„,P„ !,.„ A., P., oin ^0 

-r 01. Olu .y. V.1 .a„ r^cy.^ ^™ p„ J- 

^■'•^ 420 



PCT/US02/39133 
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pro .eu S„ oiu ..p M, 01, 

..P S„ »3p p„ p„ S„ V.1 oi„ S 

I.™ Olu p„ «« 

.1, ^ 4« 

L.U .ys val „et .y. s„ Xle 0X„ S ^r. ^ 3., Glu 

cy= Pro c„ Gl„ s.r .1, .er JL" .eu Pro 

Glu Ly, sr, m s„ ^j,^ =05 510 

515 rr" ™ Ser Arg 

Pro Ser Ala Ser Thr Val Ser Pl.i ai= „ 525 

530 " '*<' Oly Pro Ser 

M. Glu Ala Gl„ Glj Gl. Ser Gl„ Gin III .1. 

Oly Glu Glu ASP Ala Glu Glu Asp ^ |^ gIu P„ ,.1 =f= 

Ph. «. Glu A.P Ala ^ p,„ |« =,„ 
-U Cly Gl„ P.P ^, -0 

Arg Leu lie Arg Lys Asp Thr Pro His Tvr Tv« t 

605 i^f^ -^^^ His Phe Lys 

Xle ser p„ „^ | 

=ly Ket Gl„ P„ ASP Gly Glu Gly P.p Al, P.o Gly Gly ^ 

Hi. A„ Gly P„ HI. ,1, p„ ^ =40 

«u Glu Glu Glu Glu Glu Gly Ser Pro f Glu Glu Glu Glu 'ol° 
«lu Glu Glu Glu A.„ AT, Ala Glu Glu Glu Ala Ser ^ S 
=lu Glu A.P .y. Glu Gly Ala V.l Val | = Ala P,o S„ Val l^i 
.eu 1.U ^ ,1. ^ ^ ^ ™ ^ ,05 

ne Ala Gly Gly .ye Gly Ser ^ ^ ^0 
=1. XI. P.. .1. s„ AT, val ser Glu |2 Gly P„ Ala Ala S 
-a Gly val Ar. v.l Gly Aap l.y= ^u ^ Glu Val A.U Gly III 

.eu Glu Gly Ala Glu HI. HI. Glu l a" Val Glu Ala x.u 
Oly Ala Gly Ala Val Gin „ot Ar, ill Trp Ar, Glu Ar, 
V.1 Glu Pro Glu A» Al. val p„ ^ 7.5 

ASP A.P s.. P„ Ar, Glu AT, AT. || Gly Gly ^ Ar» 

Pro Leu Leu Pro Pro Glu Ser Pro n„ ^ . 825 

830 ° ^3 Gin Arg His 

Val Ala Cys Leu Ala Arg Ser Glu Attt pi t , ^40 

845 ^^"^ Phe Ser He 

Ala Gly Gly Lys Gly Ser Thr Pro T 855 

860 ^° Asp Ala Gly 

He Phe val Ser Arg xle Ala Glu Gly GlJ Ala Ala Hi. . 

875 ^ ati Ala Hxs Arg Ala 

Gly Thr Leu Gin. Val Gly Asd Ar-rr ^r^i r ^ 885 
■L i^iy Asp Arg Val Leu Ser He Asn Gly val 
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ASP V.1 Th. =1„ AT, Hi, His V.l S„ !,„ ^ 

Ala .1. ... p„ ^ «J ,15 

Gly pro !,» pro Pro s,r Pro Pro Ss Ser Ser Pro Pro Z 
Al. V.1 Al, Thr «,r S,r II. Thr Thr Al, p„ ^« 

Pro Gly p„ s.r p„ ..r S „. Ma M. S 

Olu «y pro Tyr Pro Val «1„ «1„ XI. ^ L.u Pro .r. Al. ^ 
=ly Pro .ly ser II. val sly ser Asp „i, |f ° 
His Pro PK. Gl^^v.i ^^^1005 

I..U Pro Ar, Sly^j.„ Al. Ala Ar, S^Z L.„ Ar, v.l .ly'L^° 

n. Al.^,,j A.„ Oly .!„ Asp'^I? Ar, Asp Al. 

01. Olu Ala v.l^s.r Ala ^ at/S^^ Cys si. ^'11° 
^ ..u v.1 Ar,^Ar, ASP Pro Ala P„"|^ .ly ... Ar, olu'lll 
Cy. II. Gl„ Lys^Ala Pro Sly 01. sly H. s.r lleTg 

Oly Oly Al, A,.^siy His Al. Sly Ar, Asp Pro r^Zl 

Gl. Sly 11. P.e^ii. 3.r .y. v.l S„'J°= sly Al, Ala^^JJ 

ASP Sly Arg^.„ Arg Val Sly ^JIH 
Oln Sl„ s.r ..u^j... sly ^ h^'J^ ^^^IJ^O 
L.U AT, s,r v.l^siy Asp Thr ... Thr"L° I... v.l Cys Asp"" 
Ph. Sl„ Al. s.r^™r Asp Al. Ala v.l s.r Pro Sly"li° 

ne Al. AS. Pro^g, Ala Al. Sly Il.'^f" His Ar, As. sj^ 
a. s.r II. ser J„ n. Asp Ar, slu'"= a» Pro Sl„ sly'pj" 
=ly I.ys SI. i,^^sl. ^ p„ Sly Olu Si i,. Hi. Trp Sly'^^ 
Qlu Al. ^ slu^Al. Al. Sly AT, Sly'2= si„ Pr. i,^^^^ 
ASP .yr AT, Al.^.e. Al. Al. v.l Pro^K^ Ala sly Ser V.l^J^ 
ATS val P„ ser^sly Al. Ala Sly sly'S Met Al. Sl„ Seriro" 
cy, ser Pro s.r^sly SI. sl„ Pro Prol™ Pro Pro Ser Pro'S 
=1U L«. Pro A1.^A.„ v.1 I,y. Sl„ A1,'?S Ar, Al. Ph. Ala'L'." 

Val Pro Thr Ser His Pro Pr-r. pi„ ,^ 1305 
_^His Pro Pro Glu Asp Ala Pro Ala Gin Pro Pro 

^ pro Oly Pro^Al, Al, Ser Pro OI.'S I,., ser PHe Ar/S 
AT, om Ly. ly^^pj. SI. I... slu Val'^ val Pro 01. Al.'S' 
Cly Pro Pro .ys^AT, V.l s.r w v.l sJJ Al. Asp Asp i,e„'™ 

1360 2365 
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Lys Met Gin Glu^Glu Glu Ala Arg Lys Leu Gin Gin Lys Arg Ala 

Gin Met .eu Arg^Glu Ala Ala Glu Ala"Iy Ala Glu Ala Arg"^° 

Ala Leu ASP Gly^Glu Thr Leu Gly Glu"!u Glu Gin Glu Asp"!u 

cm Pro P.O Trp Ala Ser Pro Ser Pro'Sr Ser Arg Gin sJtll 

-L<liD 1420 
Ala ser Pro Pro^Pro Leu Gly Gly Gly Ala Pro Val Arg Thr^Ala 

Lys Ala Glu Arg^Arg His Gin Glu Arg'S Arg Val Gin Ser'pJo 

Glu Pro Pro Ala^Pro Glu Arg Ala LeuleJ Pro Ala Glu Leu'S 

Ala Leu Glu Ala^Glu Lys Arg Ala Leu"^ Arg Ala Ala Arg'^I? 

Ly. ser Leu Glu Gin Asp Ala Leu Arg Ma Gin Met Val Leulfr 

Arg ser Gin Glu^Gly Arg Gly Thr Arg Gly Pro Leu Glu Arg^eu 

Ala Glu Ala Pro^Ser Pro Ala Pro Thriro Ser Pro Thr Pro'val 

Glu Asp Leu Gly Pro Gin Thr Ser Thr'ser Pro Gly Arg Leu'ser 

pro ASP Phe Ala^Glu Glu Leu Arg Ser^ Leu Glu Pro Ser Prol^^ 



Pro Gly Pro Gin Glu Glu Asp Gly Glu'Jal Ala Leu Val Leu^LeS 



1565 



Gly Arg Pro Ser'pro Gly Ala Val Glyl^o Glu Asp Val Ala^Leu 
Cys ser Ser Arg^Arg Pro Val Arg Pro1!J Arg Arg Oly Leu^Gl'y 
Pro Val Pro Ser 1605 



<210> 25 

<211> 895 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 72192179CD1 



<400> 25 

Met Lys Lys Glu Val Leu Gin Ser Ser Arg Asp lie Met Gin Ser 

Lys ser Ala Cys Glu He Lys Gin Ser Hx's Gin Glu Cys Ser Thr 

Gin Gin Thr Gin Gin Lys Lys Tyr Leu Glu Gin Leu His Leu Pro 

Gin ser Lys Pro lie Ser Pro Asn Phe Lys Val Lys Thr He £ 

Leu Pro Thr Leu Asp His Thr Leu Asn Glu Thr Asp His Ser Tyl 

Glu ser His Lys Gin Gin Ser Glu He Asp Val Gin Thr Phe Thr 

Lys Lys Gin Tyr Leu Lys Thr Lys Lys tS Glu Ala Ser Thr Glu 

cys ser His Lys Gin Ser Leu Ala Glu Arg His Tyr Gin Leu Irl 

Lys Lys Glu Lys Arg Val Thr Val Gin HI Pro Thr Glu Ser He 

130 
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Gin LYs Asn Gin Glu Asp Lys Leu Lys Met Val Pro Arg Lys Gin 
Gly 

155 160 1 

Glu Lys Asn Gin Gly Pro Ser Met lie Gly Arg Lys Glu Glu Arg 

175 1 on 

Leu He Thr Glu Arg Lys His Glu His Leu Lys Asn Lys Ser III 



Arg Glu Phe Ser Gly Ser Asp Arg Gly tyl Leu Pro Gly Ser Glu 
155 260 



185 190 

Ser 

Gin Thr Gin Asn Phe Gin Gin Thr Gin lie Gin Thr Ala Glu Ser 



Pro Lys val Val Lys Gin Lys Val He Asp Ala His Leu Asp Ser 
200 205 



Lys Ala Glu His Lys Lys Leu Pro Gin Pro Tyr Asn Ser Leu G^n 
Glu Glu Lys Cys Leu Glu Val Lys Gly III Gin Glu Lys Gin HI 



Phe ser Asn Thr Lys Asp Ser Lys Gin G^u He Thr Gin Asn Lys 



260 265 



ser Phe Phe Ser Ser Val Lys Glu Ser ^l" Arg Asp Asp Gly Lys 



275 



Gly Ala Leu Asn lie Val Glu Phe Leu Irg Lys Arg Glu Glu Leu 



290 



Gin Gin He Leu Ser Arg Val Lys Gin Phe Glu Ala Glu Pro Asn 
Lys ser Gly Leu Lys Thr Phe Gin Thr Leu Leu Asn Thr He Pro 



320 



Gly Trp Leu He Ser Glu Asp Lys Arg Glu Tyr Ala Val His He 

340 T/tc 
Ala Met Glu Asn Asn Leu Glu Lys Val Lys Glu Glu He Thr His 

360 



330 



350 



He Lys Thr Gin Ala Glu Asp Met Leu III Ser Tyr Glu Asn H 



- ^jrj. vja.u. ABU ±le 

365 370 



He Gin Thr Ala Met Met Ser Ser Lys Thr Gly Lys Pro Gly III 



385 



Lys Pro Thr Ser Leu Asp Glu Thr Ser S^^ Lys Val Ser Asn Val 



395 



400 



His Val Ser Asn Asn Lys Asn Ser Glu Gin Lys Glu Asn Lys He 



410 



415 



Ala Lys Glu Lys Thr Val Gin His Gin vtl Ala Ala His His G^u 



425 



430 



Ala Thr val Arg Ser His Val Lys Thr His Gin Glu He Lys Leu 



440 



445 



Asp ASP Ser Asn He Pro Pro Pro Ser L^u Lys Thr Arg Pro Pro 



455 



460 



ser Pro Thr Phe He Thr He Glu Ser ^to Ala Arg Arg Thr Gl'u 



470 



475 



Asn Pro Thr Lys Asn Glu Leu Ser Gin ^e^ Pro Lys Lys Asp Ser 

490 /toe 
Tyr val Glu Pro Pro Pro Arg Arg Pro Met Ser Gin Lys Ser Glu 

505 cm 
He His Arg Ala Asn Thr Ser Pro Ser Pro Pro Arg Ser Arg Ser 

520 coc 
Glu Gin Leu Val Arg Leu Lys Asp Thr Thr Ala Lys Leu Ser Lys 

Gly Ala He Pro Cys Pro Ala Ala Thr Pro Val Pro He Val gS 

550 

Lys Arg Ser Glu He He Met Ser Pro Ala Thr Leu Arg Arg Gil 

He Lys He Glu Thr Arg Gly Arg Asp III Pro Pro Thr He 

o /o 580 
He Pro val Asn He Asn His Ala Ala Ser Gly Ser Phe Arg Gl'u 

595 cnn 
Ser val Asp Ala Gin Glu Glu He Arg Lys Val Glu Lys Arg 111 
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605 



Thr Tyr Val His Lys Asp Gly Leu Asn ^e^ Thr Asp His Met vll 



Pro ASP Thr Glu Ser Tyr Asp Ala Val Glu He He Arg Lys vll 



630 



635 



Ala val Pro Pro Arg Leu Ser Glu His Ite Gin Arg Tyr Glu Ma 



650 



Ala Asn Arg Thr Val Gin Met Ala Glu III Phe Val Asn Asp Pro 



665 



Glu Asn Glu He Asn Arg Trp Phe Arg gIu Phe Glu His Gly III 



680 



val ser Glu Ala Lys Ser Asn Arg Arg vll Tyr Ala Lys Gly Gl'u 



690 



695 



Thr Asn His Asn lie Gin Gin Glu Ser IJg Thr Phe Cys Lys ITu 



710 



Glu Phe Gly Leu Thr Ser Leu Gly Asn Ser Phe Thr Asp Phe 



725 



ser cys Lys His Pro Arg Glu Leu Arg S Lys He Pro Val Lys 



740 



Gin Pro Arg He Cys Ser Glu Thr Arg S^r Leu Ser Glu His Phe 



750 



ser Gly Met Asp Ala Phe Glu Ser Gin He Val Glu Ser Lys Met 

770 



1P , 760 765 

Met 

Lys Thr ser Ser Ser His Ser Ser Glu Al^ Gly Lys Ser Gly 
ASP Phe Lys His Ala Pro Pro Thr Tyr Gl'u Asp Val He Ala Gl'y 
His Xle Leu ASP Xle Ser Asp Ser Pro L^s Glu Val Arg Lys As'n 



790 795 
-ra-v. rj.u xar uyr Glu Asp Val He Ala 
805 

Ser Asp Ser Pro Lys Glu Val Arg Lys 
o±o 820 
Phe Gin Lys Thr Trp Gin Glu Ser Gly Arg Val Phe Lys Gly £u 

Gly Tyr Ala Thr Ala Asp Ala Ser Ala tH Glu Met Arg Thr Z 

845 850 

^^j-a jTue j_j.e tje>r dlti al= tm -. im _ tx 

860 



Phe Gin Glu Glu ser Ala Phe He Ser Glu Ala Ala Ala Pro S 
Gin Gly Asn Met Tyr Thr Leu Ser Lys Isp Ser Leu Ser Asn cly 



Val Pro Ser Gly Arg Gin Ala Glu Phe Ser 



880 885 



890 895 



<210> 26 

<211> 109 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7505908CD1 

<400> 26 

Met Glu ser Asp Phe Tyr Leu Arg Tyr Tyr Val Gly His Lys Gly 
Lys Phe Gly His Glu Phe Leu Glu Phe Glu Phe Arg Pro Asp oly 
Lys Leu Arg Tyr Ala Asn Asn Ser Asn tJ^ Lys Asn Asp Val mi 

-rn 40 JIC- 

He Arg Lys Glu Glu Leu Glu He Val He Gly Asp Glu His xtl 
ser Phe Thr Thr Ser Lys He Gly Ser lJu He Asp Val Asn Gl'n 

Ser Lys Asp Pro Glu Gly Leu Arg Val Phe Tyr Tyr Leu Val cln 

85 

Asp Leu Lys cys Leu Val Phe Ser Leu He Gly Leu His Phe Lys 
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He Lys Pro lie 105 

<210> 27 
<211> 334 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_feature 

<223> Incyte ID No: 6590147CD1 

<400> 27 

Met Lys Phe Pro Gly Pro Leu Glu Asn Gin Arg Leu Ser Phe Leu 

Leu Glu Lys Ala He Thr Arg Glu Ala Gin Met Trp Lys Val As'n 

" 25 
val Arg Lys Met Pro Ser Asn Gin Ala His Pro Lys Tyr Leu Ser 

Cys He Ala He Ser Cys Phe Phe Leu Ala Ala Lys Thr Val Gl'u 

Glu ASP Glu Arg He Pro Val Leu Lys Val' Leu Ala Arg Asp slr 

Phe cys Gly Cys Ser Ser Ser Glu He Leu Axg Met Glu Arg lie 

He Leu ASP Lye Leu Asn Trp Asp Leu hJs Thr Ala Thr Pro Leu 

1 Oft 

ASP Phe Leu His He Phe His Ala He Ala Val Ser Thr Arg Pro 



Gin Leu Leu Phe Ser Leu Pro Lys Leu Ser Pro Ser Gin His llu 



125 



Ala val Leu Thr Lys Gin Leu Leu His ^ Met Ala Cys Asn G^n 



140 



Leu Leu Gin Phe Arg Gly Ser Met Leu Al'a Leu Ala Met Val Ser 
Leu Glu Met Glu Lys Leu He Pro Asp Leu Ser Leu ^ HI 



170 



Glu Leu Leu Gin Lys Ala Gin Met Asp Ser Ser Gin Leu He Ji 



180 



-■oc ^'-^^ J-J-e HIS 

lab 190 



cys Arg Glu Leu Val Ala His His Leu Ser Thr Leu Gin Ser Ser 



— 205 
Leu Pro Leu Asn Ser Val Tyr Val TVr ar-r, t r ^"^^ 
51 c ^ ^9 Leu Lys His Thr 



215 



Leu val O^r Cys Asp Lys Gly Val Phe S Leu His Pro Ser Ser 



230 



val Pro Gly Pro Asp Phe Ser Lys Asp S Ser Lys Pro Glu HI 



245 



Pro val Arg Gly Thr Ala Ala Phe Tyr hIs His Leu Pro Ala lla 



260 



ser Gly Cys Lys Gin Thr Ser Thr Lys ^^g Lys Val Glu Glu mi 



275 



" ' ~' 280 
Glu val ASP ASP Phe Tyr Asp Gly He Lys Arg Leu Tyr Asn Gl'u 



290 



ASP Asn val Ser Glu Asn Val Gly Ser Val Cys Gly Thr Asp Leu 
ser Arg Gin Glu Gly His Ala Ser Pro 'cys Pro Pro Leu Gin P^o 
Val Ser Val Met 330 



<210> 28 
<211> 569 
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<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6828539CD1 

Met°Ser\hr Ala Thr Gly Pro Glu Ala Ala Pro Lys Pro Ser Ala 
Lys Ser He Tyr Glu Gin Arg Lys Arg T^r Ser Thr Val Val Met 
Ala ASP val ser Gin Tyr Pro Val Asn His Leu Val Thr Phe cjs 
Leu Gly Glu ASP Asp Gly Val His Val Glu Asp Ala Ser A^g 

Lys Leu Ala val Met Asp Ser Gin Gly A^g val Trp Ala Gin gTu 

7 f) 

Met Leu Leu Arg Val Ser Pro Asp His Val Thr Leu Leu Asp Pro 

85 

Ala Ser Lys Glu Glu Leu Glu Ser Ty. Pro Leu Gly Ala He VaJ 
Arg Cys Asp Ala Val Met Pro Pro Gly Arg Ser Arg Ser Leu Jeu 
Leu Leu Val Cys Gin Glu Pro Glu Arg Ma Gin Pro Asp Val His 
Phe Phe Gin Gly Leu Arg Leu Gly Ala Glu Leu lie Arg Glu Asp 
lie Gin Gly Ala Leu His Asn Tyr Arg Ser Gly Arg Gly Glu Irg 
Arg Ala Ala Ala Leu Arg Ala Thr Gin Gl'u Glu Leu Gin Arg Asp 
Arg ser Pro Ala Ala Glu Thr Pro Pro III Gin Arg Arg Pro Ser 



val Arg Ala Val lie Ser Thr Val Glu Arg Gly Ala Gly Arg Gly 



Arg Pro Gin Ala Lys Pro He Pro Glu 111 Glu Glu Ala Gin l^g 



Pro Glu Pro val Gly Thr Ser Ser Asn Ala Asp Ser Ala Ser Pro 



230 



ASP Leu Gly Pro Arg Gly Pro Asp Leu III Val Leu Gin Ala g!u 
Arg Glu val Asp Xle Leu Asn His Val III Asp Asp Val Glu Ser 
Phe val ser Arg Leu Gin Lys Ssr Ala Gl" Ala Ala Arg Val III 
Glu His Arg Glu Arg Gly Arg Arg Ser '^l Arg Arg Ala Ala Gl'J 
Olu Gly Leu Leu Thr Leu Arg Ala Lys Pro Pro Ssr Glu Ala Gl^u 
Tyr Thr ASP val Leu Gin Lys He Lys ^r Ala Phe Ser Leu Leu 
Ala Arg Leu Arg Gly Asn He Ala Asp Pro Ser Ser Pro Glu III 
Leu His Phe Leu Phe Gly Pro Leu Gin Itt He Val Asn Tt. Ser 

Gly Gly Pro Glu Phe Ala Ser Ser Val ^ Arg Pro His Leu tS 

370 

ser ASP Ala Val Ala Leu Leu Arg Asp Asn Val Thr Pro Arg g!u 
Asn Glu Leu Trp Thr Ser Leu Gly Asp Ser Txp Thr Arg Pro Gl'y 
Leu Glu Leu Ser Pro Glu Glu Gly Pro Pro ^r Arg Pro Glu Phe 



PCT/US02/39133 
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Phe Ser Oly Trp Olu Pro Pro Val xhr Sp Pro Gin Ser Ar. Ill 
Trp Olu ASP Pro Val 01. g° 435 

Arg om val ..r Cln Ma Thr Cln Cln Z Ar. Gl. .rp olu ill 

Arg 01. Arg Oly Arg Ser Ala .r. Pro ^ .eu ..r Ar, .eu ill 

Phe Leu Oln Oln Ser Ala Pro oin ill Ala Val Asn Oly Z 

Arg ASP Leu Olu Pro Olu Ser Olu Pro Gin Leu Olu Ser Olu ill 

Ala Oly Lys Trp Val Leu Cys Asn Tyr Asp Phe Gin Ala Ar. Ill 

ser ser Olu Leu Ser Val Lys Oln Arg L'p Val Leu Glu Val Leu 

Glu ASP Lys Olu Lys Val Ser Glu Leu III Ala Val Met Glu L^s 

Gin Lys Lys Lys Val Olu Gly Glu Val olu Met Glu Val He 

565 

<210> 29 
<211> 429 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7170321CD1 

<400> 29 

Met val ASP Tyr Ser Val Trp Asp His He Glu Val Ser Asp Asp 
Glu ASP Olu Thr His Pro Asn He Asp Ala Ser Leu Phe A^g 
Trp Arg His Gin Ala Arg Val Glu Arg Met Glu Gin Phe Gin Lys^ 
Olu Lys Glu Glu Leu Asp Arg Gly Cys A^^ olu Cys Lys Arg Lys' 
val Ala Glu cys Gin Arg Lys Leu Lys Ol'u Leu Olu Val Ala o!u 
Gly Gly Lys Ala Glu Leu Glu Arg Leu oJn Ala Olu Ala Oln oln 
Leu Arg Lys Glu Glu Arg Ser Trp oiu oS Lys Leu Glu Olu MeS 
Arg Lys Lys Glu Lys Ser Met Pro Trp Val Asp Thr Leu Ser 
X-ys ASP Oly Phe Ser Lys Ala His Ala hJs Ala Pro Ala Thr S 
Met Leu Met Ala Ser Ser Pro Pro Pro 11^ Trp Met Gly Trp III 
^en Arg Pro Arg Pro Ala Arg Pro Arg Ser Arg Arg Ser Ala III 
Leu Pro Oln Pro Thr Pro Pro Leu ^r III Pro Gin Ser Met HI 
Asn Thr Lys Pro Glu Lys Thr Glu Glu L'p Ser Glu Glu Val 
Glu Gin Lys His Lys Thr Phe Val Glu 1^1 ^ oiu Lys Oln HI 
Lys His Phe Oly Met Leu Arg Arg Trp L^p Asp Ser Gin Lys '^l 
Leu ser Asp Asn Val His Leu Val Cys III oiu Thr Ala Asn I'yl 
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230 



Leu Val lie Trp Cys He Asp .eu Glu HI Oln Glu Lys Cys Ma 

Leu Met Glu Gin Val Ala His Gin Thr lie Val Met Gin Phe III 

Leu Glu Leu Ala Lys Ser Leu Lys Val III Pro Arg Ala Cys III 

p Q n 

Arg Gin Phe Phe Thr Lys He Lys Thr Ala Asp Axg Gin Tyr Met 
Glu Gly Phe Asn Asp Glu Leu Glu Ala Phe Lys Glu Arg Val 1^- 
Gly Arg Ala Lys Leu Arg He Glu Lys ll'a Met Lys Glu lyr III 
Glu Glu Glu Arg Lys Lys Arg Leu Gly III Gly Gly Leu Asp Pro 
val Glu val Tyr Glu Ser Leu Pro Glu Gl'u Leu Gin Lys Cys Itl 



ASP val Lys Asp Val Gin Met Leu Gin L'p Ala He Ser Lys Met 



3 65 



370 

ASP Pro Thr Asp Ala Lys Tyr His Met Gin Arg Cys He Asp III 



380 



Gly Leu Trp Val Pro Asn Ser Lys Ala Ser Glu Ala Lys Glu G^^ 
Glu Glu Ala Gly Pro Gly Asp Pro Leu ^eu Glu Ala Val Pro Lys 

Thr Gly Asp Glu Lys Asp Val Ser Val 
425 

<210> 30 
<211> 776 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7505918CD1 

<400> 30 

Met ASP Thr Glu Gly Phe Gly Glu Leu Leu Gin Gin Ala Glu Gin 

Leu Ala Ala Glu Thr Glu Glv He 'ie^r pi,, t=„ r, .r- "^^ 

^-^^ Glu Leu Pro His Val Glu 

zu 25 



Arg Asn Leu Gin Glu He Gin Gin Ala Gly Glu Arg Leu Arg Ser 
Arg Thr Leu Thr Arg Thr Ser Gin Glu T^r Ala Asp Val Lys All 
ser val Leu Leu Gly Ser Arg Gly Leu Asp He Ser His He Ser 
Gin Arg Leu Glu Ser Leu Ser Ala Ala Thr Thr Phe Glu Pro ill 
Glu Pro val Lys Asp Thr Asp He Gin Gl'y Phe Leu Lys Asn Glu 
Lys ASP Asn Ala Leu Leu Ser Ala He Gi°u Glu Ser Arg Lys ^^g 
Thr Phe Gly Met Ala Glu Glu ^r His ^ Glu Ser Met Leu HI 
Glu Trp Glu Gin val Lys Gin Arg He III His Thr Leu Leu III 



ser Gly Glu Asp Ala Leu Asp Phe Thr Gl'n Glu Ser Glu Pro Ser 



160 



Tyr He ser Asp Val Gly Pro Pro Gly A^^ ser Ser Leu Asp III 



lie Glu Met Ala Tyr Ala Arg Gin He Tyr He Tyr Asn Glu lys 
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He Val Asn Gly His Leu Gin Pro Asn IJ. v»i » 

200 i"^" I-eu Cys Ala 

Ser Val Ala Glu Leu Asd Asn 210 

Asp Asp Lys Ser lie Ser Asp Met Trp Thr 

Met Val Lys Gin Met Thr Asd Val ro„ 225 

230 ^ ^ ^^'^ i-^^ Ala Thr Asp 

Ala Leu Lys Asn Arg Ser Ser Val ri„ trti > 240 

245 ^ y^?^ ^5 Glu Phe Val 

Arg Gin Ala Leu Ala Tyr Leu Glu Gin Ser Tvr Tv. . 

260 Asn Tyr Thr 

Leu Val Thr Val Phe Gly Asn Leu His rin o-, ^70 

275 Gin Leu Gly Gly 

v,i P„ aso ^ 

pro Ma P„ p„ 3^ III 300 

P» V,. Xle ^ ^ 1^ ^ 3XS 

-eu .e„ ^. 25 

oly alu P.. ... ^ |« ^ 3^ 

AT, A,. P„ «^ 360 

A« AT. M. .eu A.. A.. ^ 

'Vr ne xlo al. A,, c^. V.1 p ^ »o 

Val Ala Asp Lys Thr Glu Asp Tyr Leu Trn T«n r r 

410 ^ ^^"^ J-eu Asn Gin 

val cy. ^ ^ ^ 

I- Se. 0X„ PH. 3X„ 0X„ ^ ^33 

Ser His Phe Thr Val Asn rin pir, n 450 
455 "^^^ f^^ ^^'^ Tyr Phe Gin Val 

-u Phe Leu Thr Ala Gin Phe Glu Ala 1 val Ala Phe Leu S 
-g Met Glu Arg Leu Arg Cys His Ala ^ 1^ „i3 Val Ala Leu Z 

Leu Phe Glu Leu Lys Leu Leu Leu Lys ler S.. ri 

500 ^ f^^ Gin Ser Ala 

.in ... 3e. Hi, p„ ^ P„ ^ 

A.„ P.e Aj, ^„ »^ III 

p„ S3^ ^ ^ 340 

Asp Glu Lys Asp Ser Gin Gly Glu Asn ml^ r ^55 
560 ^eu Arg Cys Val 

Ser Glu Leu Val lie Glu Ser Aro n„ , 570 

i.er Arg Glu Phe Asp Met lie Leu Gly 

Lys Leu Glu Asn Asp Gly Ser Ara r.r. 585 

590 ^""^ III lie Asp Lys 

Phe Thr Ser Asp Thr Lys Pro ilf> a r ^0° 

605 "^^^ Val Ala Ser Val 

AU A.„ OX. ^ p., „^ ^ .X3 

..U AXa .y, A„ Asp X.. ^ X.„ |J x.. A.„ X.. S 

^ S« Pro V.X V.X P.0 0X„ XX, S„ 2 p., 3„ ^„ ^« 



660 
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Glu Arg Leu Lys Asn Met Ala Leu Ser He Ala 61u Arg Tyr Arg 
Ala Gin Oly Xle Ser Ala Asn Lys Phe vH Asp Ser Thr Phe gr 
Leu Leu Leu Asp Leu He Thr Phe Phe Isp Glu Tyr His Ser Gl'y 
His He Asp Arg Ala Phe Asp He He Gi°u Arg Leu Lys Leu Val 
Pro Leu Asn Gin Glu Ser Val Glu Glu S Val Ala Ala Phe I^g 
Asn Phe ser Asp Glu Gin Leu Arg Ser Gin Ala Arg Thr Leu He 
Thr Phe Ala Oly Met He Pro Tyr Arg lil Ser Gly Asp Thr 111 



760 



Ala Arg Leu Val Gin Met Glu Val Leu Met Asn 
770 

<210> 31 

<211> 975 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505935CD1 

<400> 31 

Met Met Ala Gin Ser Asn Met Phe TOxr Val Ala Asp Val Leu Ser 
Gin Asp Glu Leu Arg Lys Lys Leu Tyr Gin Thr Phe Lys Asp A^g 
Gly He Leu Asp Thr Leu Lys Ttr Gin Leu Arg Asn Gin Leu He 
Hia Glu Leu Met His Pro Val Leu Ser Gl'y Glu Leu Gin Pro A^g 
ser He Ser Val Glu Gly Ser Ser Leu Leu He Gly Ala Ser Asn 
ser Leu Val Ala Asp His Leu Gin Arg Cys Gly Tyr Glu Tyr sir 
Leu ser Val Phe Phe Pro Glu Ser Gly L^ Ala Lys Glu Lys vll 
Phe Thr Met Gin Asp Leu Leu Gin Leu HI Lys He Asn Pro Jhr 
ser ser Leu Tyr Lys Ser Leu Val Ser G^y Ser Asp Lys Glu lln 
Gin Lys Gly Phe Leu Met His Phe Leu ^ys Glu Leu Ala Glu ^ 
His Gin Ala Lys Glu Ser Cys Asn Met Gl'u Thr Gin Thr Ser Ser 
Thr Phe Asn Arg Asp Ser Leu Ala Glu tyl Leu Gin Leu He 
ASP Gin Phe Ala Asp Ala Tyr Pro Gin S He Lys Phe Glu S^r 
Leu Glu He Lys Leu Asn Glu Tyr Lys Irg Glu He Glu Glu Gl'n 
Leu Arg Ala Glu Met Cys Gin Lys Leu Phe Phe Lys Asp 
Glu Xle Ala Lys He Lys Met Glu Ala S Lys Lys ^ Glu 
Glu Leu Thr Met Phe Gin Asn Asp Phe III Lys Ala Cys Gin 111 
Lys ser Glu Ala Leu Val Leu Arg Glu lJs Ser Thr Leu Glu S 



265 270 
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He His Lys His Gin Glu He Glu Thr Lys Glu He Tyr Ala Gin 
Arg Gin Leu Leu Leu Lys Asp Met Asp Leu Leu Arg Gly Arg gJu 
Ala Glu Leu Lys Gin Arg Val Glu Ala Phe Glu Leu Asn Gin Lys 
I^eu Gin Glu Glu Lys His Lys Ser He llr Glu Ala Leu Arg l^g 
Gin Glu Gin Asn lie Lys Ser Phe Glu Glu Thr ^ Asp Arg Lys 
Leu Lys Asn Glu Leu Leu Lys Tyr Gin Leu Glu Leu Lys Asp Isp 
Tyr Xle Xle Arg Thr Asn Arg Leu He Glu Asp Clu Arg Lys Isn 
Lys Glu Lys Ala Val His Leu Gin Glu gIu Leu He Ala He Isn 
ser Lys Lys Glu Glu Leu Asn Gin Ser vj. Asn Arg Val Lys Glu 
Leu Glu Leu Glu Leu Glu Ser Val Lys Ma Gin Ser Leu Ala H^e 
Thr Lys Gin Asn His Met Leu Asn Glu Lys Val Lys Glu Met Ser 
ASP Tyr ser Leu Leu Lys Glu Glu Lys Leu Glu Leu Leu Ala Gin 
Asn Lys Leu Leu Lys Gin Gin Leu Glu gJu Ser Arg Asn Glu tin 
Leu Arg Leu Leu Asn Arg Leu Ala Gin ill Ala Pro Glu Leu til 
val Phe Gin Lys Glu Leu Arg Lys Ala gIu Lys Ala Xle Val Val 
Glu His Glu Glu Phe Glu Ser Cys Arg Gl'n Ala Leu His Lys Gl'n 
Leu Gin Asp Glu Xle Glu His Ser Ala Gl^n Leu Lys Ala Gin III 
Leu Gly Tyr Lys Ala Ser Val Lys Ser Leu Thr Thr Gin Val S 
ASP Leu Lys Leu Gin Leu Lys Gin Thr Gin Thr Ala Leu Glu L'n 
Glu val ovr cys Asn Pro Lys Gin Ser vll xle Asp Arg Ser Vaf 
Asn Gly Leu Xle Asn Gly Asn Val Val Pro Cys Asn Gly Glu Til 

C p rt 

ser Gly Asp Phe Leu Asn Asn Pro Phe Lys Gin Glu Asn Val Leu 
Ala Arg Met Val Ala Ser Arg Xle Thr Ifn ^ p.o Thr Ala Jrp 
val Glu Gly ser Ser Pro Asp Ser Asp 111 Glu Phe Val Ala 111 
Thr Lys Ala Arg Val Lys Glu Leu Gin g" Glu Ala Glu Arg III 
Glu Lys Ala Phe Arg Ser Tyr His Arg 1^^ Val Xle Lys Asn HI 
Ala Lys Ser Pro Leu Ala Ala Lys Ser Pro Pro Ser Leu His Leu 
Leu Glu Ala Phe Lys Asn xle Thr Ser til Ser Pro Glu Arg S.I 
He Phe Gly Glu Asp Arg Val Val Ser Gl'u Gin Pro Gin Val GlJ 
Thr Leu Glu Glu Arg Asn Asp Val Val III Ma Leu Thr Gly Ser 
Ala Ala ser Arg Leu Arg Gly Gly Thr Ser Ser Arg Arg Leu Ser 
ser Thr Pro Leu Pro Lys Ala Lys Arg Ser Leu Glu Ser Glu III 
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740 

-y^ I.eu =lu Cly .e„ Oly ^, s.r Hi. i?= »1, se. P„ se, 
cy= Pro ASP p„ ^.^ JJ» ^ 7e5 

Ser Leu Ser lie Pro Pro Val c=>. p Z^^ ''SO 
e i-ro Pro Val Ser Ser Pro Pro Glu Gin Lys Val 

Gly Leu Tyr Arg Arg Gin Thi- pi„ t™, Zf° , "'^5 
800 ° Glu Phe 

S«: A.P val Asp Lys Leu Al. PHe Lys III a» g1„ Glu Pae Su 
ser Ph. Glu ge val Asp Gl„ Ly. 11° iie Glu Glu Gl„ S 
=iu Glu Glu Ly» lie Arg Glu Gl„ Gl„ ill Ly. Glu Arg Arg Itl 
Arg Glu Glu Arg Arg Gl„ Ser Ae„ Leu S Glu v.l Leu Glu ^ 

Glu Arg Arg Glu Leu Glu Lys Leu Tvr- m , 

875 ^ o-^"^ '^^^ Met He 

Glu Glu Ser Leu Lys He Lvs ti« t,,= t r,-. ^85 

^ '^^'^ Leu Glu Met Glu 

A» Glu Leu Glu Met Ser A,„ Glu Glu ill Lys A.p Lye Ser Z 
His ser Glu A.„ Pro Leu Glu Lye ryr 11^ Lye He zle Glu S 

Glu Gin Asp Gin Glu Ser Ala ao^s r 930 
935 ^^"^ Lys Ser Ser Lys Lys Met Val 

Gin Glu Gly Ser Leu Val Asd Thr- To,. c ^45 

950 ^-^'^ ^^'^ Asp Lys Val 

Glu Ser Leu Thr Gly Phe Ser Hi « n,. r ^60 

965 '^^'^ Asp Asp Ser Trp 



970 



<210> 32 

<211> 814 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4225965CD1 

<400> 32 

Met ASP Pro Gin Pro Leu Arg Gly Ala Ser Glu Glu Pro Ser Gly 
Thr Gin ser Glu Oly Gly Gly Ser Ser sli Ser Gly Ala Gly s^^ 
Pro Gly Pro Pro Gly He Leu Arg Pro Leu Gin Pro Pro Gin Arg 
Ala ASP Thr Pro Arg Arg Asn Ser Ser Ser Ser Ser Ser Pro Se^ 
<^lu Trp Pro Arg Gin Lys Leu Ser Arg Lys Ala He Ser Ser .11 
Asn Leu Leu Val Arg Ser Gly Ser Thr gL° Ser Arg Gly Gly L^s' 
ASP Pro Leu ser Ser Pro Gly Gly Pro G^ Ser Arg Arg Ser As^° 
^ Asn Leu Glu Gly Xle Ser Val Lys Phe Leu Arg Gly ^ 

Pro He Thr Met Tyr Xle Pro Ser Gly Hi Arg Ser Leu Glu G^u 

Leu Pro Ser Gly Pro Pro Pro rin -rhv ^35 
140 ^^"^ Leu Ser Leu Asp Trp Val 

150 
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Tyr Gly Tyr Arg Gly Arg Asp Ser Arg Ser Asti Leu Phe Val Leu 
155 160 icq 

Arg ser Gly Glu Val Val Tyr Phe He Ala Cys Val Val Val Leu 



170 



Tyr Arg Pro Gly Gly Gly Pro Gly Gly Gly Gly Gly Gly Gin 

-"-^^ 190 195 

Arg His Tyr Arg Gly His Thr Asp Cys Val Arg Cys Leu Ala Val 

„ 205 210 

His Pro Asp Gly Val Arg Val Ala Ser Gly Gin Thr Ala Gly Val 

220 225 
Asp Lys Asp Gly Lys Pro Leu Gin Pro Val Val His He Trp Asp 

Ser Glu Thr Leu Leu Lys Leu Gin Glu lie Gly Leu Gly Ala Phe 
245 250 oKc: 

Glu Arg Gly Val Gly Ala Leu Ala Phe Ser Ala Ala Asp Gin Gly 

265 270 
Ala Phe Leu Cys Val Val Asp Asp Ser Asn Glu His Met Leu Ser 
„ 1 m '^^ 280 285 

Val Trp Asp Cys Ser Arg Gly Met Lys Leu Ala Glu He Lys Ser 

290 295 -inn 

Thr Asn Asp Ser Val Leu Ala Val Gly Phe Asn Pro Arg Asp Ser 

305 3]^Q 
Ser Cys lie Val Thr Ser Gly Lys Ser His Val His Phe Trp Asn 
320 325 

Gly Val Pro Gly Asn Gly Thr Leu Thr 

3 35 340 . 

Lys 

350 „ 355 360 

Leu Pro Asp Gly Asp He Leu Thr Gly Asp 
365 370 

Leu Thr Trp Gly Arg Ser Pro Ser Asp Ser 
380 385 3gQ 



Trp ser Gly Gly Val Gly Val Pro Gly Asn Gly Thr Leu Thr Arg 

335 340 -j/c 

Lys Gin Gly Val Phe Gly Lys Tyr Lys Lys Pro Lys Phe He Pro 
350 355 

Cys Phe val Phe Leu Pro Asp Gly Asp lie Leu Thr Gly Asp Ser 

365 370 or,c. 

Glu Gly Asn He Leu Thr Trp Gly Arg Ser Pro Ser Asp Ser Lys 

Thr Pro Gly Arg Gly Gly Ala Lys Glu Jhr Tyr Gly He Val 111 

Gin Ala His Ala His Glu Gly Ser He the Ala Leu Cys Leu l^g 

Arg Asp Gly Thr Val Leu Ser Gly Gly Gl'y Arg Asp Arg Arg Leu 

425 430 
val Gin Trp Gly Pro Gly Leu Val Ala Leu Gin Glu Ala Glu lie 

445 

Pro Glu His Phe Gly Ala Val Arg Ala He Ala Glu Gly Leu Gly 

460 

Ser Glu Leu Leu Val Gly Thr Thr Lys Asn Ala Leu Leu Arg Gly 

Asp Leu Ala Gin Gly Phe Ser Pro Val n'e Gin Gly His Thr Ifp 
485 AoZ 

Glu Leu Trp Gly Leu Cys Thr His Pro Ser Gin Asn Arg Phe Leu 

505 Rin 

Thr Cys Gly His Asp Arg Gin Leu Cys Leu Trp Asp Gly Glu Ser 
515 520 

His Ala Leu Ala Trp Ser He Asp Leu Lys Glu Thr Gly Leu Cys 

Ala Asp Phe His Pro Ser Gly Ala Val Val Ala Val Gly Leu l^n 

545 550 
Thr Gly Arg Trp Leu Val Leu Asp Thr Glu Thr Arg Glu He Val 



560 



Ser Asp val He Asp Gly Asn Glu Gin Leu Ser Val Val Arg ^r 



575 



_ 580 sflc; 

ser Pro Asp Gly Leu Tyr Leu Ala He Gly Ser His Asp Asn Val 



590 



He Tyr He Tyr Ser Val Ser Ser Asp lly Ala Lys Ser Ser l^g 
605 610 file 

Phe Gly Arg Cys Met Gly His Ser Ser Phe He Thr His Leu Asp 
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620 625 



Trp Ser Lys Asp Gly Asn Phe He Met S^^ Asn Ser Gly Asp ^ 
Glu He Leu Tyr Trp Asp Val Ala Gly gJJ Cys Lys Gin Leu Ly' 



655 



Asn Arg Tyr Glu Ser Arg Asp Arg Glu T^^ Ala Thr Tyr Thr Cys 



665 



670 



val Leu Gly Phe His Val Tyr Val Pro Val Arg Ser Cys Gin Gly 



680 



685 



Ala Glu Pro His Val Arg Gly Pro Arg oln Pro Arg Asp Gin Irg 



695 



700 



Pro lie His Ala Arg Arg Leu Ala Pro A^g Leu Ala Gly Arg sJn 



710 



715 



Gly Arg Gin His Leu Pro Val Ala Ser Al^a Gly Arg Trp Gly 2g 



725 



730 



Gly Ala Gly Ala Arg His Ala Leu Ser kin Pro Leu Pro Val Pro 

/40 745 
Arg Leu Leu Pro Arg Arg Leu He Ala Ala Trp Arg Asp Arg Leu 

760 

Ala Arg Arg Arg Gly Pro Ala Pro Pro Cys Pro Ser Leu Ala Gl" 
ser Pro Thr Thr Arg Gly Arg Leu Phe III Gly Leu Thr Ser I^g 

IT- r. 790 7QC 

Hxs Ser Arg Ser Arg He Phe Leu Glu Gly Ala Asn Gly Ala Pro 
oOO one 
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<210> 33 

<211> 3570 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7495594CD1 



<400> 33 

Met Trp Pro Arg Leu Ala Phe Cys Cys Trp Gly Leu Ala Leu Val 

ser Gly Trp Ala Thr Phe Gin Gin Met SeJ Pro Ser Arg Asn £ 

25 -jA 
ser Phe Arg Leu Phe Pro Glu Thr Ala Pro Gly Ala Pro Gly Ser 

He Pro Ala Pro Pro Ala Pro Gly Asp gIu Ala Ala Gly Ser A^g 



val Glu Arg Leu Gly Gin Ala Phe Arg A^g Arg Val Arg Leu Leu 



65 7Q 



60 
eu 
75 



Arg Glu Leu Ser Glu Arg Leu Glu Leu Val Phe Leu Val Asp A^p 
80 85 *^ 



ser ser Ser Val Gly Glu Val Asn Phe Arg Ser Glu Leu Met Phe 



Val Arg Lys Leu Leu Ser Asp Phe Pro JaJ Val Pro Thr Ala Jte 



115 120 

Pro 

Arg val Asp Tyr lie Ser Thr Arg Arg Ma Arg Gin His Lys Jys 



Arg val Ala He Val Thr Phe Ser Ser Lys Asn Tyr Val Val Pro 
125 130 



140 

Ala Leu Leu Leu Gin Glu He Pro Ala He Ser Tyr Arg Gly Gly 

Gly Thr Tyr Thr Lys Gly Ala Phe Gin Gl'n Ala Ala Gin He til 

175 inn 
Leu His Ala Arg Glu Asn Ser Thr Lys Val Val Phe Leu He Thr 
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185 190 1 Qc 

ASP Gly Tyr Ser Asn Gly Gly Asp Pro Arg Pro He Ala Ala Ser 

205 oi n 

Leu Arg Asp Ser Gly Val Glu He Phe Thr Phe Gly He Trp 

Gly Asn He Arg Glu Leu Asn Asp Met III Ser Thr Pro Lys Glu 

235 

Glu His Cys Tyr Leu Leu His Ser Phe Glu Glu Phe Glu Ala Leu 
•£45 250 Tcc 

Ala Arg Arg Ala Leu His Glu Asp Leu Pro Ser Gly Ser Phe lie 

265 oTfi 
Gin ASP ASP Met Val His Cys Ser Tyr Leu Cys Asp Glu Gly Lys 

ASP cys Cys Asp Arg Met Gly Ser Cys L^s Cys Gly Thr His Sr 

295 -jnn 

Gly His Phe Glu cys He Cys Glu Lys Gly Tyr Tyr Gly Lys Gly 
Leu Gin Tyr Glu Cys Thr Ala Cys Pro Ser Gly Thr Tyr Lys III 
Glu Gly ser Pro Gly Gly He Ser Ser Cys He Pro Cys Pro Isp 
Glu Asn His Thr Ser Pro Pro Gly Ser tS Ser Pro Glu Asp lys 

val cys Arg Glu Gly Tyr Arg Ala Ser GlJ Gin Thr Cys Glu Leu 

370 -jTc 

val His Cys Pro Ala Leu Lys Pro Pro Glu Asn Gly Tyr Phe lie 

385 

Gin Asn Thr Cys Asn Asn His Phe Asn Ala Ala Cys Gly Val S 

cys His Pro Gly Phe Asp Leu Val Gly Ser Ser He He Leu Cyl 

Leu Pro Asn Gly Leu Trp Ser Gly Ser Glu Ser Tyr Cys Arg Vel 

Arg Thr Cys Pro His Leu Arg Gin Pro Lys His Gly His He Ser 

Cys ser Thr Arg Glu Met Leu Tyr Lys Thr Cys Leu Val til 

460 /ICC 
cys Asp Glu Gly Tyr Arg Leu Glu Gly Ser Asp Lys Leu Thr ^ys 

Gin Gly Asn Ser Gin Trp Asp Gly Pro Glu Pro Arg Cys Val Glu 

^oD 490 
Arg His Cys Ser Thr Phe Gin Met Pro Lys Asp Val He He Ser 

^uu 505 
Pro His Asn cys Gly Lys Gin Pro Ala Lys Phe Gly Thr He Cys 

DlO 520 CO 

Tyr val Ser Cys Arg Gin Gly Phe He Leu Ser Gly Val Lys Glu 

Met Leu Arg Cys Thr Thr Ser Gly Lys T^ Asn Val Gly Val Itl 

550 

• Ala Ala val Cys Lys Asp Val Glu Ala Pro Gin He Asn Cys Pro 

565 

Lys ASP He Glu Ala Lys Thr Leu Glu Gin Gin Asp Ser Ala Isn 

580 

val Thr Txp Gin He Pro Thr Ala Lys Asp Asn Ser Gly Glu Lys 

595 c 

val Ser Val His Val His Pro Ala Phe Thr Pro Pro Tyr Leu Phe 

610 c 

Pro He Gly Asp Val Ala He Val Tyr Thr Ala Thr Asp Leu ler 

625 

Gly Asn Gin Ala Ser Cys He Phe His He Lys Val He Asp Ma 

Glu Pro Pro Val He Asp Trp Cys Arg Ser Pro Pro Pro Val cJn 

655 660 
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Val Ser Gin Lys Val His Ala Ala Ser Trp Asp Glu Pro Gin Phe 



665 



Ser Asp Asn Ser Gly Ala Glu Leu Val ile Thr Arg Ser His S 

680 685 /CQ 

Gin Gly Asp Leu Phe Pro Gin Gly Glu Thr lie Val Gin Tyr Tte 

,1 ^r- 700 70S 

Ala Thr Asp Pro Ser Gly Asn Asn Arg Thr Cys Asp He His lie 

710 715 7pn 

val He Lys Gly Ser Pro Cys Glu He Pro Phe Thr Pro Val Asn 

725 730 -j-in: 

Gly Asp Phe He Cys Thr Pro Asp Asn Thr Gly Val Asn Cys Thr 

745 7cn 
Leu Thr Cys Leu Glu Gly Tyr Asp Phe Thr Glu Gly Ser Thr Asp 

755 760 -jf-Z 

Lys Tyr Tyr Cys Ala Tyr Glu Asp Gly Val Trp Lys Pro Thr Tyr 

770 775 7pr, 

Thr Thr Glu Trp Pro Asp Cys Ala Lys Lys Arg Phe Ala Asn His 

r,l r,^ 790 79c 

Gly Phe Lys Ser Phe Glu Met Phe Tyr Lys Ala Ala Arg Cys Asp 

805 s-in 
Asp Thr Asp Leu Met Lys Lys Phe Ser Glu Ala Phe Glu Thr Thr 
^■"•^ 820 a?S 

Leu Gly Lys Met Val Pro Ser Phe Cys Ser Asp Ala Glu Asp He 

835 840 
Asp Cys Arg Leu Glu Glu Asn Leu Thr Lys Lys Tyr Cys Leu Glu 
™. 850 ariiz 

Tyr Asn Tyr Asp Tyr Glu Asn Gly Phe Ala He Gly Pro Gly Gly 
Trp Gly Ala Ala Asn Arg Leu Asp Tyr Ser Tyr Asp Asp Phe lIu 
Asp Thr val Gin Glu Thr Ala Thr Ser He Gly Asn Ala Lys Tel 
ser Arg He Lys Arg Ser Ala Pro Leu Ser Asp Tyr Lys He Lys 
Leu He Phe Asn He Thr Ala Ser Val Pro Leu Pro Asp Glu Arg 

Asn ASP Thr Leu Glu Trp Glu Asn Gin Sn Arg Leu Leu Gin ?te 
935 940 q-t- 

Leu Glu Thr He Thr Asn Lys Leu Lys Arg Thr Leu Asn Lys Asp 
550 955 

Pro Met Tyr Ser Phe Gin Leu Ala Ser Glu He Leu He Ala Asp 
965 970 

Ser Asn Ser Leu Glu Thr Lys Lys Ala Ser Pro Phe Cys Arg Pro 

„ 980 985 aar, 

Gly Ser Val Leu Arg Gly Arg Met Cys Val Asn Cys Pro Leu Gly 
995 1000 1005 



Thr Tyr Tyr Asn Leu Glu His Phe Thr Cys Glu Ser Cys Arg lie 

r.-, ^ ■'-^■'■^ 1015 io?n 

Gly Ser Tyr Gin Asp Glu Glu Gly Gin Leu Glu Cys Lys Leu Cys 

1025 1030 in^E; 

Pro ser Gly Met Tyr Thr Glu Tyr He His Ser Arg Asn He Ser 

1045 1050 

Asp Cys Lys Ala Gin Cys Lys Gin Gly Thr Tyr Ser Tyr Ser Gly 

■'■^^^ 1060 infiq 

Leu Glu Thr Cys_^Glu Ser Cys Pro Leu^Gly Thr oyr Gin Pro Lys 

Phe Gly Ser Arg Ser Cys Leu Ser Cys"ro Glu Asn Thr Ser^Jhr 
1085 1090 inqc; 

val Lys Arg Gly Ala Val Asn He Ser Ala Cys Gly Val Pro 2Js 
-'•100 1105 iiin 

Pro Glu Gly Lys Phe Ser Arg Ser Gly Leu Met Pro Cys His Pro 
n, r, , ^•'•-'■^ 1120 1125 

cys Pro Arg Asp Tyr Tyr Gin Pro Asn Ala Gly Lys Ala Phe Cys 
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1130 



1135 



Lau Ala Cys Pro Phe Tyr Gly Thr Thr^Pro Phe Ala Gly Ser^Arg 
Ser Val Thr Glu Cys Ser Ser Phe Ser Ser Thr Phe Ser Ala^Alf 

Glu Glu ser Val Val Pro Pro Ala sJlll Gly His He ^y.'tH 
ll/s 1180 1 

Arg His Glu He Ser Ser Gin Val Phe His Glu Cys Phe Phe'L'n 
xj.yu 119S -1 <-! 

Pro cys His Asn^Ser Gly Thr Cys Gin Gin Leu Gly Arg Gly'^^ 

val cys Leu Cys^Pro Leu Gly Tyr Thr'sJy Leu Lys Cys Glu'S 
ASP He ASP Glu cys Ser Pro Leu Pro'gs Leu Asn Asn Gly'JaJ 



Cys Lys Asp Leu Val Gly Glu Php Tlo o^o r^i ^ ^^'^^ 
1250 Ser Gly 



- 1240 1245 

r Gly 

Tyr Thr Gly Gin Arg Cys Glu Glu Asn^ll Asn Glu Cys Server 
ser Pro Cys Leu_^Asn Lys Gly He Cys Val Asp Gly Val Ala cly 
Arg Cys Thr^Cys Val Lys Gly Phe'Jfl Gly Leu His Cys^Glu 
Thr Glu val Asn^Glu Cys Gin Ser Asn^Jo Cys Leu Asn Asn Ma 
val cys Glu Asp Gin Val Gly Gly Phe"eu Cys Lys Cys Pro"?° 



Gly Phe Leu Gly^Thr Arg Cys Gly Lys'lsn Val Asp Glu Cys"eu 
ser Gin Pro Cys^Lys Asn Gly Ala rt^'l^ Lys Asp Gly Ala"sn 
ser Phe Arg Cys Leu Cys Ala Ala Gly'pSe Thr Gly Ser His'cys 



Glu Leu Asn He^ Asn Glu Cys Gin Ser'lsn Pro Cys Arg Asn'ofn 



1385 1390 



1435 



Leu ASP Gly Met Leu Pro Ser Leu His Ma Leu Thr Cys Thr'phe 



Ala Thr cys Val^Asp Glu Leu Asn Ser Tyr Ser Cys Lys Cys'oi'n 

Pro Gly Phe Ser Gly Lys Arg Cys Glu'^Sr Glu Gin Ser Thr'c" 

x4X3 1420 
Phe Asn Leu Asp^Phe Glu Val Ser Gly lie Tyr Gly Tyr Val^Me^ 

i; Leu 
1445 
r Ser 
X460 
> Asn 

1475 TAon — 

TVr Asn Gly Trp Val Leu Tyr Val Asn Gl'y Arg Glu Lys Xle'^^ 
i4yo 1495 



Trp Met Lys Ser Ser Asp Asp Met Asn'J^ Gly Thr Pro He'ser 



Tyr Ala Val Asp^jsn Gly Ser Asp Asn Thr Leu Leu Leu Thr Asp 



1460 i^fic 

c Asp 
1485 
5 Thr 

Asn cys Pro Ser Val Asn Asp Gly Arg^^^ His His He Alall'e 



1505 Tcin 
Thr Trp Thr Ser^Ala Asn Gly Xle Trp^Lys Val Tyr He Asp'cJy 

Lys Leu ser Asp^Gly Gly Ala Gly Leu'ser Val Gly Leu Pro^le 

Pro Gly Gly Gly Ma Leu Val Leu Glyll'n Glu Gin Asp Lys^Lj^ 

Gly Glu Gly Phe^Ser Pro Ala Glu Ser^Phe Val Gly Ser Xle'K" 

Gin Leu Asn Leu^Trp Asp Tyr Val Leu'sir Pro Gin Gin Val'S 

ser Leu Ala Thr^Ser Cys Pro Glu GIu'l^u Ser Lys Gly Asn'Jf? 

1600 igo5 
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Leu Ala Trp Pro^Asp Phe Leu Ser Gly He Val Gly Lys Val Lys 

lie Asp ser Lys Ser He Phe Cys Ser'lsp Cys Pro Arg Leu'Sy 
^^^^ 1630 ico^ 

Gly ser Val Pro His Leu Arg Thr Ala Ser Glu Asp Leu Lys Pro 
r^■^ o ■^^^^ 1645 ifisn 

Gly Ser Lys Val^Asn Leu Phe Cys Asp Pro Gly Phe Gin Leu Val 

Gly Asn Pro Val^Gln Tyr Cys Leu Asn Jin Gly Gin Trp Thr'cS 

Pro Leu Pro His Cys Glu Arg He Ser'cys Gly Val Pro Pro'pro 
" 1690 16QC 

Leu Glu Asn Gly Phe His Ser Ala Asp Asp Phe Tyr Ala Gly Ser 

l/OO 1705 ni n 

Thr val Thr Tyr Gin Cys Asn Asn Gly Tyr Tyr Leu Leu Gly Isp 

I'lS 1720 
Ser Arg Met Phe_^Cys Thr Asp Asn Gly Ser Trp Asn Gly Val Ser 

Pro Ser Cys Leu Asp Val Asp Glu Cys^Jf Val Gly Ser Asp'cjs 

1750 

ser Glu His Ala^Ser Cys Leu Asn Val^Asp Gly Ser -Vyr Ile'cjs 
ser cys Val Pro_^Pro Tyr Thr Gly Asp^Gly Lys Asn Cys Ala'clu 
Pro He Lys Cys Lys Ala Pro Gly Asn^ro Glu Asn Gly His"er 

J- /y U 1795 1 ann 

Ser Gly Glu lle^Tyr Thr Val Gly Ala_^Ala Val Thr Phe Ser "Js 

Gin Glu Gly Tyr Gin Leu Met Gly Val'Si Lys He Thr Cys'Su 
l°'2" 1825 iR-in 

Glu ser Gly Glu Trp Asn His Leu He Pro Tyr Cys Lys Ala Val 
^^^^ 1840 1Q/1C 

ser cys Gly Lys Pro Ala He Pro Glu Asn Gly Cys He Glu gJu 

1855 1 Qcn 

Leu Ala Phe Thr Jhe Gly Ser Lys Val Jhr ^ Arg Cys Asn Lys 

Gly Tyr Thr Leu Ala Gly Asp Lys Glu'sir Ser Cys Leu Ala^lsn 
■••^^^ 1885 iQon 

ser Ser Trp Ser His Ser Pro Pro Val Cys Glu Pro Val Lys Cys 
1895 1900 ^/q^ 

Asn He Asn Asn Gly Lys Tyr He Leu Ser Gly 
^^^^ 1915 1920 
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ser ser Pro Glu Asn He Asn Asn Gly Lys Tyr He Leu Ser Gly 
_ 1310 1915 iqon 

Leu Thr lyr Leu Ser Thr Ala Ser Tyr Ser Cys Asp Thr Gly 

1930 iD-jc 
ser Leu Gin Gly Pro Ser He He Glu Cys Thr Ala Ser Gly lie 

" 1945 locn 

Trp ASP Arg Ala Pro Pro Ala Cys His Leu Val Phe Cys Gly Glu 
1955 I960 iQcc: 

Pro Pro Ala He^Lys Asp Ala Val He Thr Gly Asn Asn Phe T^ 



Phe Arg Asn Thr Val Thr Tyr Thr Cys^Lys Glu Gly Tyr Thr^Leu 

Ai ^^^^ 1990 iqqc 

Ala Gly Leu Asp^Thr He Glu Cys Leu^Ala Asp Gly Lys Trp Ser 

Arg ser Asp Gin Gin Cys Leu Ala Val'ser Cys Asp Glu Pro'p" 

2020 oAoc 

He val ASP His Ala Ser Pro Glu Thr Ala His Arg Leu Phe GlJ 

2035 onyin 

Asp He Ala Phe Tyr Tyr Cys Ser Asp Gly Tyr Ser Leu Ala Isp 

2050 onc;^ 

Asn Ser Gin Leu Leu Cys Asn Ala Gin Gly Lys Trp Val Pro Pro 
^Ubu 2065 

Glu Gly Gin Asp Met Pro Arg Cys He Ala His Phe Cys Glu^Lys 



53/89 



wo 03/050253 

PCT/US02/39133 



Lys Phe Ala Ala Gly Ser Val Val Ser Phe Lys Cys Met Glu Gly 

Phe val Leu Asn Thr Ser Ala Lys He Glu Cys Met Arg Gly Gly 
r.1 m 2125 2130 

Gin Trp Asn Pro Ser Pro Met Ser He Gin Cys He Pro Val Arg 

r, r,n , ^"'•^^ 2140 214S 

Cys Gly Glu Pro Pro Ser He Met Asn Gly Tyr Ala Ser Gly Ser 

2155 2160 
Asn Tyr Ser Phe Gly Ala Met Val Ala Tyr Ser Cys Asn Lys Gly 
2165 2170 917^ 

Phe Tyr He Lys Gly Glu Lys Lys Ser Thr Cys Glu Ala Thr Gly 
ni ^ r, 2185 2190 

Gin Trp Ser Ser Pro He Pro Thr Cys His Pro Val Ser Cys G^y 
^-'•^^ 2200 2705 

Glu Pro Pro Lys Val Glu Asn Gly Phe Leu Glu His Thr Thr Gly 
2210 2215 ooori 

Arg He Phe Glu Ser Glu Val Arg Tyr Gin Cys Asn Pro Gly JJr 
^^^^ 2230 22^S 

Lys ser Val Gly Ser Pro Val Phe Val Cys Gin Ala Asn Arg His 
^■^^^ 2245 oo<^n 

Trp HIS ser Glu Ser Pro Leu Met Cys Val Pro Leu Asp Cys Gly 
^^^^ 2260 226S 

Lys Pro Pro Pro He Gin Asn Gly Phe Met Lys Gly Glu Asn Phe 
r.T „ 1 2275 2280 

Glu val Gly Ser Lys Val Gin Phe Phe Cys Asn Glu Gly Tyr Glu 
2285 2290 99qc; 

Leu Val Gly Asp Ser Ser Trp Thr Cys Gin Lys Ser Gly Lys Trp 

2305 23]0 
Asn Lys Lys Ser Asn Pro Lys Cys Met Pro Ala Lys Cys Pro Glu 
^■^^ 2320 2325 

Pro Pro Leu Leu Glu Asn Gin Leu Val Leu Lys Glu Leu Thr Thr 
" 2335 ■j-kArt 

Glu val Gly Val Val Thr Phe Ser Cys Lys Glu Gly His Val Leu 
ni ^^^^ 2350 235'; 

Gin Gly Pro Ser Val Leu Lys Cys Leu Pro Ser Gin Gin Trp Asn 

2365 2370 
Asp Ser Phe Pro Jal Cys Lys He Val Leu Cys Thr Pro Thr Pro 

Leu He Ser Phe^Gly Val Pro He Pro'ser Ser Ala Leu His'phe 

Gly ser Thr Val Lys Tyr Ser Cys Val^GlJ Gly Phe Phe Leu'l^g 
m r, , ^^^^ 2410 2415 

Gly Asn Ser Thr Thr Leu Cys Gin Pro Asp Gly Thr Trp Ser Ser 

2425 OAio 
Pro Leu Pro Glu Cys Val Pro Val Glu Cys Pro His Leu Arg lin 

2435 2440 oa/ic: 

Pro His Gly He He Asp Val Gin Gly Leu Ala Tyr Leu Ser tJ^ 
AT . 2455 o4fin 

Ala Leu Tyr Thr Cys Lys Pro Gly Phe Glu Leu Val Gly Asn Thr 

^io5 2470 
Thr Thr Leu Cys^Gly Glu Asn Gly Hisjrp Leu Gly Gly Lys Pro 

Thr Cys Lys Ala He Glu Cys Leu Lys'pro Lys Glu He Leu'lfn 
m T ^ 2500 2505 

Gly Lys Phe Ser Tyr Thr Asp Leu His Tyr Gly Gin Thr Val Thr 

'ibxQ 2515 "jcion 

Tyr Ser Cys Asn Arg Gly Phe Arg Leu Glu Gly Pro Ser Ala L^u 
mv 2530 oc-jc 

Thr Cys Leu Glu Thr Gly Asp Trp Asp Val Asp Ala Pro Ser Cys 

2540 2545 2550 
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Asn Ala He His Cys Asp Ser Pro Gin Pro He Glu Asn Gly Phe 
„ ■■ ^^^^ 2560 

Val Glu Gly Ala^Asp Tyr Ser Tyr Gly Ala He He lis Tyr Ser 

cys Phe Pro Gly Phe Gin Val Ala Gly'sls Ala Met Gin Thr'cys 
2585 2590 9^QR 

Glu Glu Ser Gly Trp Ser Ser Ser He Pro Thr Cys Met Pro lie 

2605 261 n 

ASP cys Gly Leu^Pro Pro His He Asp^Phe Gly Asp Cys Thr Ss 

Leu Lys Asp ASP Gin Gly Tyr Phe Glu'c^n Glu Asp Asp Met'S? 

2635 oan 
Glu val Pro Tyr Val Thr Pro His Pro Pro Tyr His Leu Gly Ma 
„ 1 ^^^^ 2650 iRH'i 

val Ala Lys Thr Trp Glu Asn Thr Lys Glu Ser Pro Ala Thr His 

2665 9fi7n 
ser ser Asn Phe Leu Tyr Gly Thr Met Val Ser Tyr Thr Cys Asn 

2680 ofioir 
Pro Gly Tyr Glu Leu Leu Gly Asn Pro Val Leu He Cys Gin Glu 

2695 2700 
Asp Gly Thr Trp Asn Gly Ser Ala Pro Ser Cys He Ser He Glu 

2710 271 =; 

cys Asp Leu Pro Thr Ala Pro Glu Asn Gly Phe Leu Arg Phe Thr 
2720 2725 2730 

_ Ser Cys Lys Pro 

2735 2740 2745 

Gly Ser Asp Leu Arg Leu Cys Leu Glu Asn 

2755 2760 
Ala Ser Pro Arg Cys Glu Ala He Ser Cys 
2765 2770 2775 

Lys Gly Ser Asn 

2780 2785 2790 

Leu Tyr Tyr Glu Cys Asp Pro Gly Tyr 
2795 2800 2805 
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Glu Thr ser Met^Gly Ser Ala Val Gln^Tyr Ser Cys Lys Pro Gly 
His He Leu Ala^Gly Ser Asp Leu Arg^Leu Cys Leu Glu Asn'I^g 

Lys Trp ser Gly Ala Ser Pro Arg Cys'cK Ala He Ser Cys^Lys 
2765 2770 277S 

Lys Pro Asn Pro Val Met Asn Gly Ser He Lys Gly Ser Asn Tyr 
m. 2785 27Qn 

Thr Tyr Leu Ser Thr Leu oyr Tyr Glu Cys Asp Pro Gly Tyr Val 
^795 2800 -pone; 

Leu Asn Gly Thr Glu Arg Arg Thr Cys Gin Asp Asp Lys Asn 

2815 28?0 

ASP Glu ASP Glu Pro He Cys He Pro Val Asp Cys Ser Ser ^ro 
2825 2830 oa-ic 

Pro val Ser Ala^Asn Gly Gin Val Arg Gly Asp Glu Tyr Thr Phe 

Gin Lys Glu He^Glu Tyr Thr Cys Asn^Glu Gly Phe Leu Leu'slu 

Gly Ala Arg Ser Arg Val Cys Leu Ala'lfn Gly Ser Trp Ser'cS 
ST mu r, ^ " 2875 2880 

Ala Thr, Pro Asp Cys Val Pro Val Arg Cys Ala Thr Pro Pro Gin 

2890 OQQc 
Leu Ala Asn Glyjal Thr Glu Gly Leu^Asp Tyr Gly Phe Met Lys 

Glu val Thr Phe His Cys His Glu Gly^?^ He Leu His Gly^Ma 

2920 OQOi^ 
Pro Lys Leu Thr^Cys Gin Ser Asp Gly^Asn Trp Asp Ala Glu He 

Pro Leu Cys Lys^Pro Val Asn Cys Gly^Ko Pro Glu Asp Leu^Ma 

His Gly Phe Pro Asn Gly Phe Ser Phe^Jle His Gly Gly Hislle 
™ 2965 2970 

Gin Tyr Gin Cys Phe Pro Gly ivr Lys Leu His Gly Asn Ser Ser 
TV , « 2980 2QDC 

Arg Arg Cys Leu Ser Asn Gly Ser Trp Ser Gly Ser Ser Pro Ser 

2995 tnnn 
cys Leu Pro Cys Arg Cys Ser Thr Pro Val He Glu Tyr Gly Thr 
•, •^"'^^ 3010 Tfiic 

val Asn Gly Thr Asp Phe Asp Cys Gly Lys Ala Ala Arg He Gin 
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Cys Phe Lys Gly Phe Lys Leu Leu Gly Leu Ser Glu He Thr Cys 

3040 "^o 
Glu Ala Asp Gly^Gln Trp Ser Ser Gly Phe Pro His Cys Glu H^s 

Thr ser Cys Gly Ser Leu Pro Met Ile'^ro Asn Ala Phe Ile'sSr 

3070 T.r\nc: 
Glu Thr ser Ser Trp Lys Glu Asn Val He Thr Tyr Ser Cys Arg 

3085 OAQn 

ser Gly Tyr Valine Gin Gly Ser Ser Asp Leu He Cys Thr Glu 
i Ser 

3110 - '3115 



Lys Gly val Trp Ser Gin Pro Tyr Pro Val Cys Glu Pro Leu Ser 



Cys Gly ser Pro Pro Ser Val Ala Asnl^a Val Ala Thr Gly III 

3130 3135 
Glu Ser Glu Val Lys Leu Arg Cys Leu Glu 
3140 3145 3^2Q 

Thr Cys Gin Lys 

_ w -QOT" Dvr% Ttm r- — 

3170 3175 



Ala Pro Thr Tyr Glu Ser Glu Val Lys Leu Arg Cys Leu Glu Siy 
3^^" 3145 
c Phe 
3160 



Tyr Thr Met Asp Thr Asp Thr Asp Thr Phe Thr Cys Gin Lys Asp 

3160 -5 1 CR 

Gly Arg Trp Phe Pro Glu Arg He Ser Cys Ser Pro Lys Lys Jys 
^■^'^ 3175 Tinn 

Pro Leu Pro Glu Asn He Thr His He Leu Val His Gly Asp A^p 

3190 n Qc 

Phe Ser Val Asn^Arg Gin Val Ser Val^Ser Cys Ala Glu Gly J^r 

Thr Phe Glu Gly val Asn He Ser Val'cjs Gin Leu Asp Gly'Sr 
^^l^ 3220 -kooa 

Trp Glu Pro Pro Phe Ser Asp Glu Ser Cys Ser Pro Val Ser gs 

3235 ->0An 
Gly Lys Pro Glu Ser Pro Glu His Gly Phe Val Val Gly Ser lJs 

Tyr Thr Phe Glu Ser Thr He He Tyr Gin Cys Glu Pro Gly'gr 

jZo\) 3265 
Glu Leu Glu Gly Asn Arg Glu Arg Val Cys Gin Glu Asn Arg'Sn 

Trp ser Gly Gly Val Ala He Cys Lys^Gl'u Thr Arg Cys Glu'S 

3295 nnn 
Pro Leu Glu Phe^Leu Asn Gly Lys Ala^Asp He Glu Asn Arg Tto 

Thr Gly Pro Asn Val Val T^r Ser Cys'lsn Arg Gly 'Tyr sJlll 

3325 ■5T3n 
Glu Gly Pro Ser Glu Ala His Cys THr Glu Asn Gly Thr Trp Ser 
" 3340 T3/tc 

Hxs Pro val Pro Leu Cys Lys Pro Asn Pro Cys Pro Val Pro Phe 

3355 lien 
val He Pro Glu^Asn Ala Leu Leu Ser^Glu Lys Glu Phe Tyr vf 1 

ASP Gin Asn Val^Ser He Lys Cys Arg'ciu Gly Phe Leu Leu'S 

Gly His Gly He He Thr Cys Asn Pro"s| Glu Thr Trp Thr^Gln 

- 3400 3405 

i His 

val Glu Asn Ala^He Ala Arg Gly Val^^His TVr Gin Tyr Gly'^sp 

Met He Thr T^^Ser Cys T^r Ser Gly^T^r Met Leu Glu Gly'Jhe 

Leu Arg Ser Val Cys Leu Glu Asn GlyiS: Trp Thr Ser Pro'pro 

He Cys Arg Alajal Cys Arg Phe Pro Cys Gin Asn Gly Gly ^" 

Cys Gin Arg Pro Asn Ala Cys Ser Cys'J^o Glu Gly Trp Metily 
3485 3490 349^ 
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Arg Leu Cys Glu Glu Pro He Cys He Leu Pro Cys Leu Asn Gly 

3505 Tcinn 

Gly Arg Cys Val^Ala Pro Tyr Gin Cys Asp Cys Pro Pro Gly 

Thr Gly ser Arg Cys His Thr Ala Val'cys Gin Ser Pro Cya^Leu 

3535 Tc^A 
Asn Gly Gly Lys Cys Val Arg Pro Asn Arg Cys His Cys Leu Ser 

^-■45 3550 T^Rc; 

ser Trp Thr Gly His Asn Cys Ser Arg Lys Arg Arg Thr Gly Phe 

3560 3565 35^0 



<210> 34 
<2H> 3S96 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7722608CB1 

<400> 34 



caScgcgc c?cccaagca Taalt'^^^ -^gcgggcc ccgccgcgcc gccccccaag 60 
cgcgccctS actgcatgaa cttac^r^o= ^^^^^^^^^^ agctggaggc gcggttcaac 120 
aacgagaaga agtgggagct citctataa^ f ^^^^^^^f agctgctgag ccagtatgac 180 
acaaccta^;, t^n^ZT Z ^^^^^^^^S^gat caggagcggt ttcaagtcaa gaatcccccc 240 

si ii Si =1 =1 E= 

SSSSS L-SSS °~ 

cSss VciirP """'"^ '^z^i iti 
s i ~ ~' ^ 

ssss did? i F-s '°» 

aacttccg?g tlttcc^gca ItaSaattc a™ tacattcggt ggagaacatg 1020 

aggcttcggc tcaccgaaaa ta^n^tn^^^ acccacttgg gcctggacct gtacttggag 1080 
a??tttgl?g tggSSc? gctgaSaac ^^!f ^""^^^ tccaggcgta cctggacaat 1140 
atggaggaal tgcagSca l^laatn^t^ acagagacca agaacgctgt gctggagcac 1200 
gaaSccItgg ccaagaSg? agafctaa^a lltt^'^^f^^ ggcttcggga cgcggagaac 1260 
gagaccctgc gggagcgc?t caacaaa?™ ^^^"^^^^^aa gccaggcgcg caaggagttg 1320 
gagcctga?a aSgcgcctcc cgltcccS gcgcctccag gcgtccccca 1380 

gaggagctgg agga^aagS StaltcS? at^cf™^ cggccctgga gctgaaggtg 1440 
atcgagatcc tccccatcac attctgcggg ggccggggga tgctgtctcc 1500 

gtgccgaccg gctcccccag cccaS'tc gScctacaa gactccgggg 1560 

gcgccaccgc cgocgccccc actaccr^^^ n^^^^?^ cagagccggc tcccggagca 1520 
gcgcccccac aSgccccgcc Iclaaotaln cgcaggaagc cccgccctct 1680 

occggagacc tgccgccccc Scccgcca Sacca™ gccgccgctg 1740 

gtgcctccgc cgccgccgcc gccgccacca r^f^^^ ctccgggcac tgacgggccg 1800 

SSS S i ' S~ S«S SSS LIS 

sss i iiSi ~" -s- 

cagaaggccc ccagcaaggc gacac^cat? ?aaaJn^?^^ ctctcaagag taaggcagcc 2100 

"sss li rF°F Sc'ss.- is 

ctc„e.?ec -SSS SSS! LIS 
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PCT/US02/39133 

atcattgcag cctcaatgtc catcaagtcc tctgacaaac tccgccagat cctaaaaatt ?S?n 
gtcctggcct ttggcaacta catgaacagt agcaagcgtg gggcagccS tggc^Sg 258? 
T.T.nT'' gttggagatg aagtcgactg I?cgcaagca Sgctgctg 264? 

ctgcacttc? tTallllaar tacccgcaac tcaLggctt ccacagcgac 2700 

^^^^r"'''' tggacaaggc gggctcagtg tccctggaca gtgtcctggc ggacgtgccc 2760 

SS?:s £c~ — asss 
acss sss^ ssss? ~i -~ Sii 
~- s— ?~ - bs s 

cataataaaa "^^^ff^^^^ acggaggcag cagaaggagc cStca??ta ?gagagcgac 3^2 
cgtgatgggg ccattgaaga catcatcaca gatctgcgga accagcccta catccacaca ^yln 
ITrr';^?^"" ^ccgcagtgc ccgtcggcgt cccccgggcc ccccfctgca ggtcacc?cc 33^? 
gacctctcgc tgtagccgct atttctgcag gtggattctg caggggtgtg gqaccataaa llln 
caggctgagg ctcaaggaag gtggtcctca gctcggctgg ccgggcagcc cc^cc^ccac 34^2 

cgcccccccc cccaaatgct gcttgcagca cccaccctaa agccccctcc aaafarrr^nat- ika^ 
acttagcctc agcaggagcc tggcctgtaa cttataaagt gcacctcgcc cccSaaacc ^6^0 

Tallllllll laaaaf '^'^ tggaccttat ttttatatga gattaSaa g^tgtttS 366? 
aaaaaaaaaa aaaaagcggc gctcgcgatc tagaac 



3696 



<210> 35 

<211> 5585 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505869CB1 

<400> 35 



gaSt??tc agcScatcc I^cf ^^^ggtcgtg ccgtgcgggg acggccacat 60 
yaaagctccc agcctcatcc agcaggcggt gacccgctac cggaaggcca tcoccaaaoa Ton 

sscS sss^ sssss s~ g is 
ps? ^sss ~ 

i Hi SSii ~ -ssa ~ 

ao«^?»^f^? ccttcaagga aaaatcccac acgctggtca acaacagctg gcttcctcaa 540 

= ?K ss?- 1~ ~ 
= ss«::-a- ~ 

gcatataccc aacttttctc tggatgatat ggtaaagctc gtagaagtcc ccScaS RdS 

^= ^sss -g-s s™ = 

SSS^ S"S=g SSS — S SKS ? S 

aaataaagag cagtatgaac aactatccca aagtgagaag aacaat^ac? attcaaacca llTo 
^gtgSS SZTclTa SaacScc^ ^^T^^^^^ aacagtgcag T.attoToll llto 

aIctca?aSc gcJcacccct cgggaaaacc TcaJn^'^^f atagactctc actcaagact 1260 
^ a>-ai,cn-t.ccc cgggaaaacc accatccgct ccagcctcgg cacctcaaaa n9n 
tgtatttagt acgactgtaa gcagtggtta taacaccaaa aaaataggca agacact^aa ^lln 
lllTaafaal f^^^?^^*^^ cagaaggttt gggattcagc atcact^cca glgatg^aac ^440 
gStggcSa ctta™^ tctatgtgaa aaacattctc ccccgggggg cggccaSa ^sJS 
ggatggccga cttaaggcag gagacagact tatagaggta aatggagtag atttaatoatT i ^fin 
aattTT tttcgctgtt gagaagclcc aagafgSag Jaac^tga? ^62? 

ccStSg lilaTaZl' ITcallT' ccacccaagg gaLtgat^ cagagJcS? Itlo 
ctyduycag accccaaaag aaacgaaagc agaagatgag gatattgttc ttacacr^^ra i74n 
tggcaccagg gaatttctga catttgaagt cccacttaa? gattca^gat ctgcaSacct lloQ 
tggtgtcagt gtcaaaggta accggtcaaa agagaaccac JcagatSgg gaatc?tS ^860° 
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cctaagaagg tctatgtcta ctaaaS;; aaccaagatg ccatggaaac 1980 

aaggagaata agcaagtSa atgagc?Saa aTrl'^ll^^^ atgatccagc ttattgttgc 2040 
gcccattgaa acagcgtE^g atS^aaaaa f.cT ^11 agcccccctg gacctgagct 2100 
gattgagggg cttga^g^S cgcccagS SSctocc ^r'^"'^"'^ tctacagtgg 2160 
gtcaggtaaa taccagctgt ccccScaa? llllltl^ ctcagtagga taatgggtga 2220 
agaagatgac aggttgccag tgcttcctcc !catS^^n^ ^^^^^'^^'^^ ctgtcattat 2280 
ccatgatgat gtggggtttg taacJac^a^ f ^ gaccagtcct cttccagctc 2340 

tgattcagcc gactgctcS ^lltlJl tgctggtact tgggccaagg ctgcaatcag 2400 
aggatttSga cgSgaS tScagaala aca?"''"^ ^ttcttgctt ttcaacgaga 2460 
attggatttc gttaalacac Saaa?™ T.^ T^^^l caattttcag atgccagtca 2520 
taaactcaat acagtggatg accaaaaaac TrZl t^^^ ttaggtatag ctgacgagac 2580 
cctgggtctg aagLg?caa gctcgttgga SatcSr ^^^^^^^^^^g tgggtccttc 2640 
tttgaatggg gatattcctt tcc£iatln ^ ^ accgcagttg ccgaggtgac 2700 

caatgagagc ?tcagagctg ccatcgacaa atc?ta^^^? ataatcagag gcaggggatg 2760 
tgatgaaggc atggagacc? tggSS caci^==^ aaacccgcgg tagatgatga 2820 
gtctgtatcc acagccagtg atcagcctfc ^lltll^T ^^"=^^3^^ cagggagaga 2880 
ccaagagaaa ggtgataaga ctgSaaaaa l^lr, l^^ ^agagacaaa tgaatggaaa 2940 
agatagagat algjagaagg atSaS^aaa ^lll actggaaaag aaaagaagaa 3000 

agacatgttc aggat?c2J ccaaaacSJ agaatttaoa HT^''^^^ agggcttggg 3060 
tgactatgct gaaattcaag attttcatca llllUllJ,^ gaacgacaag ctcgagagcg 3120 
tgggggagtt tcttcttatj aaggttccat ?ac?ctS?^ tgtgatgatg agttaatgta 3180 
agaagggcat atgatggatg ctttatltac llf I gctagacctc agagcccacg 3240 

ctcacctgta gacag?!acf gatcaacSc tagcaat^^t ^ttccaaacc 3300 

gcaagaattt cagcaagcaa agcaagatga aStgtJSaJ lll^lT'' ^^^^^ctgag 3360 
ttttgagcaa ccctggccga acgcacaacc lllllllll ggacctatag 3420 

cgtggaggtg caga?gcagc gjcagcggca Saaclar^o ^^"^^^^Srac actcggtgtc 3480 
gcgccagtac agctctctgc ctcwcaaaa llllll^^t gagagctccc agcaggccca 3540 
ctcttgggag cagaactact cccc?™ Za^l^ gccagctcgg tctcccagga 3600 
gtactccagc taccaaggct ccaSaacaa Itl^ll ^ agtgccaaag agaaccccag 3660 
ggtcatgctg gaaactcagg JgSccttoS iT^Zlll^^ ggacatggct tcaacgccag 3720 
gaagaagcag cctccttccg SgcccaJ caac?ata^o aggcggaagg agcagcagat 3780 
ccccagttac gcccctccca agggS?? ccaaca^a?^ ^cgtataaga aagtccagga 3840 
ggttgcgagg ctgaacagac ttcaaactc; till ^ gtgcccccct ccccttctca 3900 
acgcaaataa cgjatgc^tc atgtcacaca J-llT^^^ aggcccttct attcctgagc 3960 
ttttgggagt tllttl.ell cctcSSgt aSatafaat UllTll^ aagacttgta 4020 
tgcctttaag cggtgtaggc aaaaaaataa Unllll^ atttctgttg ttggtatcag 4080 
aagtgtctgt ttca?gg^^g g^t^tcccac ^cgSc^a If^l^ -ctatgtctc 4140 
tatgctctgc gcctctccac agtaccaaaa ^Itll^nl^ tcatctgttt gaggtgttca 4200 
cttggttgta acatccctgc accacttjca l^lltlT.l ^^^^^^^^gag ttgtccgcgg 4260 
gtgcggccct gtttctccct ctaagttctc ?taactataa llT'^^^l^'' ggaggatgac 4320 
tggaggaaaa gtgggcgaca tacacSaaa SggggS If^T T. ^^^'^^'^'^^gg 4380 
ccatttgtcg tactttgtca tcactgtggt Stc?c??tc ctS^^^f cacagcacag 4440 
gggagagtcg gtcatcctat tacaoaaact lllnlll . ''*''^^^tcagc tctttgtgac 4500 
tgggggtcct tttgtaactg cctSafct lltt^l^ ccaacattgt ttggtcacca 4560 
ctgcgtttat acaEacgct? gt"gScct attcch^r" ^^^aaagtga tgatcctggt 4620 
ctctgtgtgg ggaacg^ggg Sacfggagt ggctcctgcc SSaaS ^^^^^^^^^^^ ^^^^ 
ttggccctgt gtctatcatg aggacaaaar Tllllll gggggaagct gggcctgcca 4740 
tggcccagta catcctagta a??a?t?Sa ttaatt^f^ ^-^^^g gaagagctca 4800 

gggcacagaa tgggacagag ataaaccaga ciaSt^S f ^f^'^ ttgtcaggaa 4860 

tcaagtcaga ggcaattgat gcttgtctfa lllltllll aatctgctct ctacggtttt 4920 
atgccacgct cctaagtagt ^ctgccaSa 2ca^™^ cactgcatgt ctgactggcg 4980 
catcacacag agaacatttt cgggtcccic agcaataat^ acaaaggaaa agccgttaca 5040 
tcagtattag agtgtgtgtg tggStctcaa llllll^^ I gcaggaagct cactctcgcg 5100 
gttctgaaca tccEgEa?tg tSaccataa ^llll^^l ggctcccatc ttccttcatt 5160 
tgtggaaaaa gctggaggtg aSgctSg a^ScatS tSStS ^^^^^^^^^ ^^20 
aaagacatgt atggatatgt ctattththt- tlTZll tttactttaa aaacagaaaa 5280 
ttgtttttag aaifgatg^g Saca™ ll^llfll^ cattgtattt ttgtgttgac 5340 
tggttctgtt tcttaatcac gtgcScgS gtctaaaSo "^^^'^'^^^^'^ ttctgtgtca 5400 
tgaccttgga tgacagtggc tctttcfo^^ ll T^^^ tgttaccagt gtacgcgcag 5460 

ctctgtacat a?gcaL?^c taaSa^gg cStttctt lllT^lr ^^^^^^"^ ^^^O 
aaaaa t-atacccctt cctgttaaaa aaaaaaaaaa 5580 
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<210> 36 

<211> 2667 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7502420CB1 

<400> 36 

t£a"tc'S SSiiSS SSSc Tct'TaT' f "^^^^^^ ^t,t,,,.ct SO 
aggtctgaga cctScgaac? acaactcccg Saaaca?ca ^^^"^^^^^^ cagcccctgc 120 
gggcaaccaa tcgtaccaga gcacrqacaaa rc-Z^ 2 gaaggccgcg tcgtaattgc 180 
cgccaatggc agagctccca ?cgcagagcc tgSgg^taa alfT"^^^ ^^-ggcctc 240 
ggcggggagg agccaagggg gcgagcaaac tlaatl^n^^ Sttgtgaggc ccgggccggg 300 
gcgcccttca ttgaagcggc aataar™ ^^SfSTt^Srctg ggtgggttgg ggcgttccgc 3 60 
ggctggaaaa tgccagtcca ^^cccgagga Stfa^f Sftagtggaaa gcgacggcgc 420 
atggaggtgg actacgccga aaatgaaaaa ^a^tn^t^^ agaccaacca tcacgatgag 480 
tcgtcggtct ccgaggatgj agaSgSca ItllfTj^ aggacgagga cactgagagc 540 
agaatggaat gtttggatga aatgtccaat cttgallaS aa?^^^"''^ tgaaagaaga 600 
caactttata aagaacgatt aagEcaggtg gScaaaac '^'^^^^^^S^t 660 

aaagcaccag aatacttgga accoctaoci tnlf^ tacaagaagt catagctgga 720 

aaggtagcag gaatctaS agagctcS ttaa^atc?^ aaaatatgca aattcgtaca 780 
gaaattcaag cttctcgcca gca?tataaa ^nrnt ^ taaagaacaa atatgaatgt 840 
cagagtgaac tagaggagaJ gataSagg cttgaaaaa^ ^"^'^tata tgatacagtc 900 
acctcagagc tgtggaatga tgagc?tcaa tcaaa^ff?^ ataggcacag cattgatatt 960 
cctgacaaaa agaagccagt tgt?g"tca aatocah^h» agaggaagga tcctttcagt 1020 
cttgatattc ttgaagactg gacaacaat^ ™ tagtttatat gctacaagat 1080 

agagtgaaaa cggaaccacc Stjaaac^g Xaaafca^^ tggctacatt ggggccacac 1140 
gagggaagac tatattatga tSSJaataa ttt^^^ t *^Sfcacagtgc tagatctgaa 1200 
aaaaaagatg aatgtcctlS aaSg^ta aScaaSf f?^^^^^^^^ atgtattgat 1260 
tttaagaggc ctgatggaag caaSctSg ctttacaf ^^^^'^'^f ga tgaagtttgg 1320 
tattcaatta aacattcati atcatgatS Statta^. .^""^f^l^"^ gaaaggaaaa 1380 
taccaaatgt aagtgccatg agagtSaaa aa?afa^^o '^^^^tttacc ttattagtgt 1440 
aatcatgaga gaatgtgta? tgSggtag tacStaaat IcT^.^'T ^"'^tcactg 1500 
taaaagaaac agtaataaaa attttatcac aa^rn^^^ ^^^^f^^^^tt gatatgttat 1560 
gatgaccaat aggtattctg taJatggtS gaattt^t^? ^tgatttgcc tcttaggtcc 1620 
ttaaaaaaag ttatgcaaat ttgtcttato Ittlatlltn f ^^^^^^^tt ttctttggtt 1680 
atttttaaaa ttttccatat ct?tgtca^t c^tta^^^^^ tatgactaca tttatctgca 1740 
atgtttccta taaatggttt gtaSaSJ att^aSf ttgtaaataa ggccgataga 1800 
catatatata tatgaggatg LtaSggc atcScaact ?a^^f^^^°'' ^aaatttaaa 1860 
accttacaat cttgttttac ccaaaat-?3« 7^r.^tl^^ tatttagaac tgatggccat 1920 
ttttgtagaa tagcaacttt tctt?Stc ^ttcttaf^^ ^'^f ^^^^^ aaaggagcac 1980 
atttttacaa attataccta atgagtaSa ttaa^^^ ^tatggtggg gtggtgacct 2040 
gtatttctag tgacccttta gcggSgS tttatacc^a ^.'^f,^?*^^^ gcttctacct 2100 
aagtggtgaa caataactga cagSaSta cfc^So?^^=^ gtatttatga tgcagtatat 2160 
gattttagta agcattttcc agaggtaaaf ctSa^.^f^ atgtctggtc ttttgaaaca 2220 
gcaaagtaga cagggattat t?ccttgaat cUttlr^ll attctaattt tattcctagg 2280 
tatttctaca ctttaaggcc atttgg?S StSaaaa ttttctttgg 2340 

gccacattca aaattaactt ccaaaarr^o ^"^'^^^^^S tgttggcctc ccttccgcta 2400 
tatggcagca cagccggc^g Stgtatat tf^.nn^t^^^ caaagaattg aaaccctcaa 2460 
ggtcttgtgc actttaatt? cStScaat lllff^^"^ ccaaatcagg tattcctggt 2520 
atttaacact tactgcagaa a^g^c^Scat llSlalftf Sratgaggaag aaatctactt 2580 
ctccaaaatc ttcctgtjca gaSacf ^attccgttt gttttcttat tattttacct 2640 



<210> 37 

<211> 1559 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 
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<223> Incyte ID No: 7506455CB1 
<400> 37 

S:SttS SScaS? iTairf 'T""'''"' .tgcgagctg 60 

agacggacac tgggtggaaa a^aaaa^^^? gttgctgctc agtctgtaac ggcaggtggg 120 
agccgaatgc agagSa agaS^tn ^r-T^^,^^^ tgggagggct gattttcgag 180 
aggaagaaag gagSc??S Saaaa?Sca Itntt^ ^^^ ^cctgattac attccaacag 240 
ctttggctgc Lcagtatg ?Sttactc aSSaa? S^f agatctgtgc 300 

gtcatcccaa atatggttcc atccctaaa^ tt^f It ^f^^^^^^^^a atactttcaa 360 
ctggaaaact ttcttatata ^Ittn^t ^tatacttgc ttgtatcatg ggatactttg 420 
ccc£tggaga a^ctttacga tcaaSca^a aagagaaatt caagaaactt gaaaattccc 480 
atcaalagtc aaaatatgic tcllatatnt ^t^^'^^^l'' ttcaccacct gggcactatt 540 
cagcagacaa cataaaaata .^f^^^^ gtggtcaatc atcttttgtg acatccccag 600 

atSaa^ctgc tcccactgg attacStc lllttTtT.l f^^^T ^^^^-^^^^^ ^60 
gagatacttg ggatgagtga aaaattarah ? agtcaaagta aacaagtatg 720 

cttcatctag gtgg^cataa tttrnl^^^^ cattggacat gaaggagttt caacatccag 780 
acatttaaaa cagatcc^g gSttSata atll^^^ff cagcagcagt cttcataaac 840 
cagattttta gtgttaata? tatlltlfl^ Stttgacttc tatggtgttt taaaaaaaca 900 
gtattatacc atg^tSat acaatttat? ' agggattcat ttgaatgatg 960 

aaaaaccgtc gagattfcaa tactlllaJ aaattgttgc aagggcaaag ataactctta 1020 
atgttgaftt gggg^ggatj Jjgggagc" SStt^t taaa^.'""" ^^^^^^-^^^ 1080 
aagaaattat taaaaagctt tttt?c?cca ataSh^^ ^^^f f gctttgcatc 1140 
caaataaaat gtaaaggaac atgcaacaar tattatctta atgtttatgg 1200 

gtttgtaatt LttgStta tttttaaact aS^^!^^^'' cttgatttta tggttttatt 1260 
gtcatctata atatttctit alllranntt ^^'^^^^aatc atgtaggtat aggaaataaa 1320 
gaggcagtgg gcagtggttc atgc^ataa tlT^^^^l f ^^^^ctat aatatggctg 1380 
gattgcatga gcccaggagt SgaaSaa ^o^S^^''^'''' ttgggaggcc gaagtgggag 1440 
caaaaaaaat taaagSaJ^ i~g^ c^SgcSt ^cSS^ StLT Ts's 
<210> 38 
<211> 645 
<212> UNA 
<213> Homo sapiens 



<220> 

<221> inisc_feature 

<223> Incyte ID No: 7506018CB1 

<400> 38 



tc^Sgca'g? gcagScS cSccgSc c^^^f agcacggcgg cggcgtcgtc 60 
cagcatgagc ggcctg?gca tet^^^^^f '=^*^=^9cccg ccggcccgtc tgtctacccc 120 
gc?cttcg?g gaggc?g?gg ITcttTrTr. ^'^^Srgtcacc ggctcccgcg aagactatga 180 
caagcctgtc cagagttS cStaalct T.T^^^^^ ttcctgaagc tggcttgagt 240 
ggccatgLg gaccttttga ccaactccct Stcattcc^ ^^^^^^^^'^^ gccttcacct 300 
ccccaatgca ggccacttct cctr^nt^^f gtcattccta acctaacctt agagtccctc 360 
aaggcaacat SctScca ttagtctcag aaattat^^^ cctccatcta 420 

ggctgccccc agccaagcat tggggccScc atcJh^n^f aagcaacagc cccaaatgct 480 
ctgttggttc catcagccag litllT^Tcl Sgccccgc agtccctcS "'^^^^^^^'^ ^40 
cctagaggca attaaatgat gtcctg^tcc at^ggcaaS 2aS --^^aggac 600 

<210> 39 

<211> 1824 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 353017CB1 

<400> 39 

cctttattct gacaaagaga tgacaacccc agccaagcag attaagactg aaagtgaata 60 
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SS?S^,' SSS '^?f 

c^sss" :-:c^s -es -° 

gtcctcctag aagactacca at-r.nn^==ff cagcattttc ccagtcccct gaactgccca 3 00 
aattaaaccg tSatataS cacatccatc ^^^f caagcaaaga aatttgtatg 360 

atatcagtga agtttctgja a?tatttrff ctgagttaag aaaaaactta gaaaaagatt 420 
cagatgtgca aSagcSS? tatatt^S^ gtcaaatgaa ctcagggagt tcagtctcag 480 
gaactcajaa agajaatac? toaSt^a^^ f^^'^^'rf^^^ gacagttctc aaaaagatct 540 

tagatgg^tc t^tSga^tc aaSSS Talllclli Zl'Tclaf' Tt^^"''' 
gtgatatttc caggggcatt gctgatcaaa ^^'^^^'^'^tc ctgtatgaag 660 

ccacatggat gtttgaaacc cllatrntn^ aaatcattgc tggtggcgat gtgaaatata 720 
tagaaaa?gc agagaaaatt ccllTaalnn ^^^^^^^tgg ggctcattct tctgacactg 780 
tgbttgaaac afggccaSg ga^caaSa allT^lT. J^'^^*^^^^'^- ^ctcggtgga 840 
cggtaactat caataaaa^r zt^l^t ataaaatgca tcaaagtcaa gaagaatcag 900 
aaactcaaca tSg^^caa cttaSSa^ ^fftlf'^t ^^""^^^^^^^ tacatgtttg 960 
agcttaggtc tgagctcSa aaal^ta^f^ ^''^5*'??^^ ^gatgaggtt catttactgc 1020 
aaactcaacc a?tl?atgtt atltaaatta att^l^^^^ aagaagtata aaatgtttcg 1080 
ttcacagaga agacgttjaa aaaaaa^^^^ Sttcgggcca aatgctggaa attaaaactg 1140 
agccgt?gga caStSc ^SStca SgSttIa TatfT^'^ tttgaaacac 1200 
tggaagaaaa tgtcaaaggt ggggtaaata t ^^^^S^ccga ggaatatcca 1260 

tggagaaaat caaagag?ca gaagaaaSa tr^Jf^^^''^ gttatttgaa acccaacctt 1320 
atgtctccag aaag?g?tgg Itaftfaltt Ir- ^^^^tt ^^^^^^^^^a ataggtacag 1380 
ctgatgcaga ttc?c?acaa cSaaaaaa tttt^'"'^'''' agacattcta aaagaagttc 1440 
atctatttga aacacttcca aHallla.t ^^^''^^Srtgg tgatgtacaa actactaagc 1500 
aaaaaaatca ctgcc^ct^a agaagaaaaa a™?^?f^ tcctgatata ggaaagcttc 1560 
gaaacccaac ctctggaaga SSSaaaaa ^rgcatcaaaa atggattttt 1620 

cttgaagaaa gttgacagag allatatat^ ^^^^^^ agtacacacg aacagtgaaa 1680 
atttcattaa atttgataca f^attacaca catatctttt gaatcaaaca 1740 

taacagttaJ ?taaKc?ct ct?c ^^^^^^^^^^ aaggagttac agaaggtgtg 1800 



<210> 40 

<211> 8043 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5137816CB1 

<400> 40 



cllli:T.l ttSSg :Sc?g:S g'ca^S^^^ ^^^"^^^^^ ^^^^^ - 
tctggctgcc taaactgg?a gaccccc^S attactccf^ cccttccttc cgaggagctt 120 
tcctcttctc ttagttctct ccgcetc^n^ attactcctc catctccgct ctctttcgcc 180 
cctccggcta tccgctSctc cttrnt^t ^^^''^'^tacc accacctcca gtcagtctcg 240 
ggagagcccg gacggagSg cTaaaaaaoa ^^"^^^^"'^'^^ tgcctgtccg ctgccaccaa 300 
ccatccctgg ccgc^gaccc aaal™?^ agcagccggg agttggggct tcccccctgc 360 
gtcttctt?g ccttcagaga Ictarttt^^ ccacttgagc gagcagagag tcgtcacctt 420 
ttttgtctcc ttagaaSS ggc^aaaafa ^ftf'^^T ^^^^agcaga gtgaaagagc 480 
cggcgcttac cctgaacctg aatStaa^^ ctaaagagtg accagtggtt ggactcttgg 540 
gctgctcata agcIagSt cccStcSa Itrt^Ttt f ^^-^'^tgc ggtttctgga 600 
aataggtcaa gggcagccca ggtgcccaa? ttc^ctc?^^ '^ffSrcctaaa agaaagcaga 660 
aatagagatg agaaattctg aaaaacao^^ cttcacaagg caccaccagc 720 

tacaagagca gct^cgcta? Jgcaatccte ataaalT.^ accacggttg cagcgtttgc 780 
agcaagccac agggaatggc cctcctttr^ fftgagaatcg cagcctgctt gccatacacc 840 
gtgatgtgtt gag?c?c?aa aScfJn^^^ ^cagtggcag tgggaacccg ggccctcaga 900 
aacccca^g ?caggaa2c cagtcaaaa^ aacagcttgt ggctcatgct gctcgacaag 960 
gaatgca^a LSSa ctcccglcct ataaaLf' ^^^^^^^^^^ ctgtctcctc 1020 
ttcggggcca acagcatacc tltnrf^^^ atgaagaagc caaggtccag tcccagtact 1080 
agaagatgag Jac^gaggga cgcccScao ^^^^^"^^f tgtcactgga gtcaccaacc 1140 
aagatgaggg Ltclgagac cttaaa™ l^^^fT'''' -^^^^cctgga aagatgcacc 1200 
.gcagatgtc actgglS^ Zl^l^^l g^^SS JS^SS 
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aggcccSg gScttc^t TalllTolT ^^^''^^'^"^ aagttccagt gaattctaca 1380 

caLagaa?^ tcc^ttSj ggcScca^ cccSS ZlTaair ^^'^ 
caaaacac^^ r^a^^,rr^r,=„ ^ cccaacctgt agtgtgcaag ccccaagagc 1500 

tSggtatca gcatccccc? aaaSta^f accagccagg gagaacagag gggcaactga 1560 

c?= ~ ii i= ii^s 

agatcctctc agacgagaac cgSaacttaa Z^llT cagagcccag cagatggttg 2040 

s^sss r F™ -ss's "°o° 

~? s iii ~ s-ss 

aactctftnr. gaatatgagg ggtcagagga oaccagaaaa accatctcgc 2340 

cSctgccSg KStaccSt SaSccafa f ^^^^^^^^ gaagctggaa gcggagctS 24? 
tgagtaatgc cclggccaag g???taaaac faaf ''^''^^ cgaaatccga gatcaggccc 2460 

iiiSEEilliiiiiiiSipii 
iiiiiiliHssiiiiigi 

~ i™ 3o'^s 
=s ? ■ ™?e 3\i„° ■ 

SK^g S=S' t"""'= 

cgss i^g^? r-F ---- 

ctccagctgt ggctcaacrct (racr^n-^^^^^ ccgcggcccc aactccagct ccaactccaa 3780 
gtttgtctat accaaa?tS aonf^^f caagtccagc taccggtcct ggaccacatc 3 840 
ccaaLc^ct ggaaa£2a actcccattc laT.'cT"'' '^^^^^^^^^ gtgttccact 3900 
tggtggaata tctcatctaa agatcctggg acaagagcct gatgcagaga 3960 

t£?aa?catt JtccScttt ta?S??St TatfT^^"" agattatcag caaaaatgct 4020 
gaggggatgt tttttaacaa ;ar^^^^^^ tStttt^agg gtgaggacaa gggttgtggg 4080 
gtglacacca gtctatttta tggaacaatg tactacttga gtaataccat 4140 

gfes i iF? SSS tlTo 

aaagcactca cctcaggct? acaaaaaS^^ f^^??^*'^''*' atcctgtgoa gcagctggag 4440 

sgsL^? s^S^j f F'F 

aacttcc^nh ^ ctttcagatg ctgggagagt acattaactg agtgatgcgc 4740 

gctaStgca gact't'^S^g gtttSSSt' a'^fT'^" ^^tggagag? t|cSS? IsSS 
aagaccaltc Igcttagtg? tS^?tS?^ f^^^^*^^^^'^ acagcttgct tgcatttagt 4860 
gtctcaaatt acllltlltl III tacagcacag attcactggc tgggtctcoa 4920 

?tgaatttct ggtgact^ct Sgaaatalc ^^ttlTtl gttgaatgct 4980 

tcttaaaagg catEtttaac lltlllll ttgttttgtt tgtcaaattc taagcatatg 5040 
aagg catttttgac tatcacctcc aagggaatag cttgagaaac ccaaagtact 5100 



6 3/89 



wo 03/050253 

PCT/US02/39133 

SSca?c llTtlZT. ITcTaTr' '^-l^-^ootc ttctcactgt 5160 

tcaaaggagg caatggctcc llalllfan. ^ff^^cagga aatgtgcatg tgggtagaga 5220 
gccagttgcc acagSccct gccctatatc TrTf .^'' ctccaaggac acagaaggat 5280 
ctgcagatag catcattaag atcag^ttta aSaaSaS. ccttcattaa ggtgagaaat 5340 
ggtgttaggt aagggttggg ggtaSqStt tllanltlt^ Srgagggtgag ggaagtgggg 5400 
tagaagagta ggcctgatat a?tS^l^ ttgggatgtc ttagttagaa accagattaa 54S0 
atatagccct acctcctca^ ?^taa^™f^ gagccatagt ggtgggaaag aactttagca 5520 
ctttttgaaa gtc^tSSc ^caactS^c tffT^''^^ gaactggaga ggttcagtga 5580 
aactctttat ctttt?taJa ttr^«^^^^ f ^^^gccaa tcaccatgct tattttggga 5640 
tgtatcttgt aelllgtlca tlaltlalat Sf^T^" tcttcatggt ccagggaata 5700 
ttctgtcttg cctStccttc aa?aS?.?^ ^^'^^^'^^'^^ gtcttgacaa actactgaat 5760 

caacftaga^ tcacat^aa? gSccc ga g^^Saatc ccacttctt? T''""^''' '''' 
gggtatctta gataatcttt tggggttcct c??taaata? ^ ctaggagatt 5880 

ttatcttcca aaggaattat tttatatltl t^^lf f t t^^^^^^^ta tccttcctaa 5940 
ggtgatgtga tt^^cagcta StccStc^c aSttcafS !f ^^T^^ cataaaggag 6000 
acacattagt cctcaattat ^^Zrri^ll t ^^^^^'^^^^ atgaggtaat catgttacat 6060 
aagactct?t catacSccc tttaS^«^« tcagatagaa attaaaagtg cagttgtgtt 6120 
agcaagtagc aaacScS agagctaSt ttlT.T" ^'f^''^""" tagtagaagc 6180 
acctactcct ttcataatta Ittn^n^tl '^'^^^^'^^^^^^ taatttaaaa aacacttttc 6240 
attttaatca clltalTcll ggtgacaa^? llcltT.Hl ^^^^^^^^-^ gggagccatg 6300 
ctctttagac tcatgtgttt SgSaaaaa fa.^^ ^f ^ ccacattgtg tggtagccat 6360 
cttitccagaa ccctctSca ccagagacia ttcthS^f tggctgagta ctattttgaa 6420 
aatttgctaa aataacatt? tacatccaaa aa™= ^ ttcagtttcc aatccccgat 6480 
gcagaagata gatacagcc? JttcSga Sccactaa TtSalf^ ^^^^^^attat 6540 
gtttatggga gaaatctcaa ttagccacct ttlrtnt^ff tttttgttcc aaaaattcaa 6600 
ggatctttct ggactcctac cJa?ctgtgc atttta™ ^^gtggatat aacatttggg 6660 
accaggttgt cttagcttgt actgc^tggt Tatclctala ^^^^^^f^^ aaggagggtg 6720 
ccccagtgtt ccatgtataa aaaaarhhn^ ^^^''^^'^^Srag gaccttctaa ttcagttgta 6780 
ctattaacag tctt^ttttg aaataS^a^ ll^^tt''^^^ ctttacttga tatgaaactc 6840 
cttgattgtt gatttaatgl Italtttrnf ^f?""^^^'^ tttcttttaa aatcatgtat 6900 

aggacagLg ^aacct^aS S^ctgccS catct^a? S^aa'Sta ^^^^^^^^^ 
catcatgttc tctttttatt cr*-ha^a^^^*r ^"^"^^gccac aggaaagata cccagagacc 7020 
aactgattga t?gtgc2c Sctttttlt S^'^^^^^" taacaattgg aaaatgaaca 7080 
ttaaagtgia taca?atgcS cSctttga ^ca^t^atat tTTt^t ^'^^<=Srgcagc 7140 
cagtctccta ttatctcaat tgcttatata rtttl t tctcatatct gttagctaca 7200 
attcttactg aaaatgSt caaaaa™ ^^^^ ^^^^'^'^^^tta aaacgtgtgc 7260 ■ 

cagcctgcca cgattcttgg aaata^gtc? tccaaatacc IfftT^l^ ccaccaattt 7320 
gtgtcgggag tttgtttatt t-i-hhr^^=^^ tccaagtgcc atccatcatc agtaggacaa 7380 
cctcc???tg gccSccttg tja^taataa T.Tftl^ ggttacatgg agccatgaaa 7440 
ggcagatggc tagagaagaf Sgcctttaa tttSf^J ? taaaatggat agctggggtt 7500 
tgaccccatt ctgtLtcaa ctSaactaat Inl^t^^ atgtggtccc ctaatgattg 7560 
tttgtgccta tcttttcact g^SaataaS Itla.ltttf "^^^^^^^^ ttaaatccac 7620 
aaaatgcttt ggtgtttacc ^gaSagaggc lltTttt^ll ctat^^^''^? taccgtgtgg 7680 
aattcagtat gcaataaaat ^^r.n^^^^= cccatttgtg ctgtatcagt gattactttc 7740 
tgaagtatgg lagtgatS taSStS K?aSf ""^'^^^^^^^ tgcatttgct 7800 
tctaatcggt ctctttatat acSgJcag c2a^^?taa ggccgcatgc 7860 

ttctggtttc tagaaataaa atcr^tr,^Zr '=^'^®*^attaa aagctttgca tctttttatc 7920 

catt??ggaa ggc^S^ga t^^tcc^ttt gtacc^ttat tTT^^"" -tttggttag 7980 
ttt >-acccccttt gtacccttgt tctgatgcca attcatcagt 8040 

8043 

<210> 41 

<211> 1550 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7502151CB1 

<220> 

<221> unsure 

<222> (1) ... (1550) 
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<223> a, t, c, g, or other 
<400> 41 

gcStSac gagt?taftt iTatlTtll ^Scaggtttt tagatcctca 60 

ctctgaccct Sgc^cagc ccgagSac a^^?f'^°?? ccagaagctt tcaccctgcc 120 
aactagggcc c^StcaSt cgccSgaa JcccSaS^ "^^^^^'^^^^ ggcgagaagc 180 
ggccagggcc gctcgcccaa cctgSc tgcctcttca aT.T''''^^ ggcagagcca 240 
ccctgcggct acggagacaa acaaa^^^^^ tgcctcttcg gccatggaag ctgccggcag 3 00 
gcgcaaScag cgSSgS aagaaaa^ta ^a^f ''^^'^^ cagcgacctg acgggacctg 360 
aaacaagtat g£gaS?t?t tcSSaS^ J.^f '^''^f^ gaggaggtcc cagtatatga 420 
ggccactgct ccSgtcac" catccaaSo h^f^^?"? cccccagggc taagccctga 480 
agccaaacgt aacctgaagc oaaaaaaa^^ tgaaggtggt gaaccaggcc tctccaagac 540 
agaggccttg agcaggactc ?tga?aagga TtllafT^^^ cagcaagaga aaggagaggc 600 
tgctccacag ggctc?cggg cagcccccS iTatallf^t gagacagccc aactccccag 660 
caccactgag aaagccaaga aaataasS«« ^^f gaccagcctg actcagctgc 720 
gctgcagcag cggatccagg Sgjaaag? caaccaf aaactccggc aggtggaaga 780 
gctagcaagg aggagggcgc taaafaa^S ^f^^ agcaaagagc agctagaaaa 840 
cctttgggga a^aggglS JactgSSa cHltn^f^'' ttggagttag gcctctgagg 900 
aatacgggca acagSgtca ggagSSac cccccSc? laT.fT.^ ^^tctggggg 960 
ggcccagctc tgtoctccag Scctajcat Italrtt^tl ^^^"^^"c cacctcctgc 1020 
ttctactttt tggactttcc ccctccca?^ ccttcccttt tcttcccaac 1080 

aggtaccttt gcEgctgatt tSgtgtctt atttL^ aaaatctcag tgactacccc 1140 
tcttggagaa ctgaggctga ggg?aaaaaa .l,^t^^ aaatcaggtg ggtgggaatc 1200 
tgttcgcggt gt?tf?gg?t iTtlTctll ?Stccctc a?tt^St? ^^S?f 
aaggcaaaat tgagaataaa cacaagtaga cttatctaa^ ttttttaaaa 1320 

cgcgccagta tagggagctc gaaaccgaaa allfl^lft ^^^^^^^^^^ aaaggggggg 13 80 
ggggggagga aggajtccaa ^a^acSgSa KgagSag acaa^^?^^^ acctggaggg 1440 
ggtaccaaat acggcaaaaa gggagg^S SgSSc ISSggcSS ^^^^^^'^^^^ J^^J 

<210> 42 

<211> 2108 

<212> DMA 

<213> Homo sapiens 



<220> 

<221> inisc_feature 

<223> Incyte ID No: 7505983CB1 

<400> 42 



ctScf ItlTcllTcl ?tctS iTaTcaT ^° 
agcgggggcc aggccgaaag gcccggaacc lalta^^t ^"^9^"=^^^ acgaaggaga 120 
tcttgcctgc agtaagtgac allllft^n^ agaagggtgc cgagacagaa ctcgtcagat 180 
cagccaagag gagat?g?Sc ???gttoaaa cco^f^^"" tagtcgtgct cgaaagaggg 240 
aaccattgcc EgJaaagS ccaaJaaaaa ,rtnt ^^^"^ aaataagtct cctgaggcca 300 
agaagggacc ccagtcccta tttaatgc?c ctcaiaarf ^^^'^^^^^^ST acagctggta 360 
gtgatgagga agaggaggag ttcttagaaa ctlttl^ gaagcgccca gcacctggca 420 
ggaccaatac cttgLaacc cgacacccaa Tr-^tl^^^ gcctcggccc gtcaccctcc 480 
ttaacctgga tcccctgggc aagtgatcaa ^^f^^^^^^^tc aatcgtgggg 540 

tgcccattgg tgctaccccc aaaSS^oS agactggact agtggtgtat gattcttctg 600 
gcatgttgcc cgtcaScc Sggcacccc agafacatS "^*^^°^^=^Sr ggagcctcca 660 
gtgcccctgg aggaaagacc agctacatgg cccagctgat allT. ''''^ ^acatgtgtt 720 
ttgccaatga cgccaatgct gagcggctca lal^alfnr ^^^^^^'^f^S ggtgtgatcc 780 
gagtcaccaa caccattatc agcSctS ^ gggcaacttg catcggctgg 840 

gctttgaccg agtactgctg gSctccc? aclato^of Srttccccaag gtggtggggg 900 
cagccgtgaa aactaacaa^ gcagtggcac tggggtcatc tccaaggatc 960 

ag?tg?tcc? glTtgotltl T^TcT.ttt IZlT.aT.^ -^tgtgctcac ctccaSagg 1020 
tttactgcac ctgttctatc Zrtntl^: ^''^^f^^^tc caagacagga ggctacctgg 1080 
aaaagaggaa tg^gcSctg gtgcccIcS gcctaaacS f^^^^^^^^^ tatgctctga 1140 

-f?ss Ei i ~c sss^^ j-s 

agtcccagae aggaa\^?tct Sac^^ca -L^^-- ^a^^S S^gS^ 1^^^° 
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tccccaagtc tgagaacagc agccagccag ccaagaaagc caagggggct gcaaaaacaa 1 4dn 
agcagcagct gcagaaacag caacatccca agaaggcctc cttccfgaag ctgSggca 150Q 
tctccaaagg ggcagactca gaattgtcca ctgtaccttc tgtcacaaag acccaagctt 1560 
cctccagctt ccaggatagc agtcagccag ctggaaaagc cgaagggatc agggagSa leS 
aggtgactgg gaagctaaag caacgatcac ctaaattaca g£cc?ccaag aaagt^gct? JssS 
tcctcaggca gaatgcccct cccaagggca cagacacaca Lcaccggc? gtg^tSccc ^74? 
catccaagac tcaggccacc ctgaaaccta aggaccatca tcagcccctt ggaaagacca 1800 
agggggttga gaagcagcag ttgccagagc agccttttga gaaagctgcc ??ccaaaaaS 186^ 
agaatgatac ccccaagggg cctcagcctc ccactgtgL Lccatccgt ^ccaSgcc JS^S 
ccccaccagc aaagaggaag aaatctcagt ccaggggcaa cagccagc?g ctSaSS JIsS 
agatggttga aaactagacg ggtggctcac tgccattgtc accaggttgg aaJtcttacc llln 
tctgSr ^^^^"^'^^'^ actgtgcata cccatgaaat ttaaScS ?ttS2cc llto 



tctgaaaa 

<210> 43 

<211> 524 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> ndsc_feature 

<223> Incyte ID No: 7505986CB1 

<400> 43 



2108 



cSteaaaS ttl^non^^^ ?f f^f =°sr gattgcacgt ggagggocgc ccgaagggca 60 
ctctcggaca ttaacccgca ttctgtacca tggggcgcaa gttggaccct acaaaaaaaa i ?n 
t?t?SSc lllllllat^ ^cccggaagc agaagggtgc cgagcaS ct^gtcSt 1 
ccttgcctgc agtaagtgac gaaaattcca agaggctgtc tagtcgtact caaLaaarrrr oln 
lllT^llll ?SaaS° tctgttgaag ccccLa^ac aaatS^ct cctgaSca 3 
aaccattgcc tggaaagcta ccaaaaggga tctctgcagg agctgtccag acaactCTa^a ^fin 
gSStc ItTatllT. ctcgaggc^^ gaag^gccc^ ttTa/cZ 

Ica?a??tta aSr^^n^f ^^^^^^^^^^ tctctactgt gcatacccat gaaatttaat 480 
acacatttta aaacctctga aaaaaaaaaa aaaaaaaaaa aagg 524 

<210> 44 

<211> 1987 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3231075CB1 

<400> 44 

ttctgcttat catggttacc aacaaaatga ctgctgcctt tagaaaccct aataaa^;,^n fin 
aggtggcgac agacaaagtt gcagaaaagc tgagc?ctac tc?ctcatgg Kgaajaaca 120 
cagtatcgca tacagtcagt cagatggcca gtcaggtggc aagtccatc? ac??cStac llo 
ataccacatc ctcatctacc acactatcaa caccagccct ttcaccatct tccccatcac ITo 
agttgagtcc agacgtctta gaactcctgg ctaaactgga agaacagaat atattgttag 300 
ttatalaall ^^'^"'^^ agatctgtaa atgggtcaig aagaaacagt ggctcSctc 360 
tSggggaaa ^attotta^^ tctagcaacc tcagtcacct tgaagaagat Sttggittc llo 
cccggggaag aattgttaat gaatgggaag atgtacgcaa aaagaaggaa aaacaaotta 4Rn 
aggaacttgt tcataaaggg ataccccatc actttagagc aatagtttgg caac^Stat tTo 
c?S?aaaa ^^f attaaggatc agtattca?a actcctgSa atgacctcgc 600 

ftt^^tccga agggacattg ctagaactta ccctgaacac aactttttta 660 
aggaaaaaga tagccttgga caggaggttt tatttaatgt aatgaaggct tactctt^aa ^22 
tagatcgtga ggttggttac tgtcaaggaa gtgcttttat agt£gga?tg ttgcttatgc IBO 

gSaacttS IZlTaTal ^'^^^^^^-^ ttgttaaatt aatg^Lgat tSgaSSc 840 
gcgaactttt taaaccaagt atggcagaat tgggcctttg tatgtaccag tttaaatat-a qnn 
tgatacagga gcatcttcca gagctctttg tacattttca atcSagaS t^teatacct 960 
caatgtatgc atcatcctgg tttctgacta tctttcttac aacttttcca cllcalattl ?n?n 
t^gc^cS tttatgtctg agggtttaga aataSg't Sgtagga? 10 S 

tagcacttct tcagatgaat caggcagaac tgatgcaact tgacatggaa gggItg?Sc 1140 
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agcactttca aaaggtcatt ccacatcagt 
cagcttacca agtcaaatac aattcaaaaa 
caataaaaac gaaagaaatg gaagagcaag 
gacttttaaa acagcgcatc gagacattag 
aagattttgt gctacagcta gagaaggaat 
ctcagtgtgc attaaaagag atgcaggata 
cccttcctga tgagaataat attgcaaggc 
gagaagcaga agccattatg ggtttgaaag 
aacactggca gcgccactta gctcgtacta 
atgctatgaa tgagttacag gatgaactga 
aagcagaaat aagagaaata aaacaaagga 
ccgcagatgt ggagcctgca tatatgaagg 
gtatctggca tgtgatgagc tcgccacaaa 
atttacagtt tctactttcc tgacactctg 
aaaaaaa 

<210> 45 

<211> 2857 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7503516CB1 



ttgatggtgt cccagacaag ctaatccaag 1200 
aaatgaaaaa gcttgaaaag gaatacacta 1260 
ttgaaattaa aaggttacgc acagaaaata 1320 
aaaaacataa atgcagttcc aactacaacg 13 80 
tggtccaagc ccgactgagt gaagctgagt 1440 
aagtcttgga tatagagaag aggaataact 1500 
ttcaggaaga actcattgct gtgaaactta 1560 
aacttagaca gcaagtcaag gatttagagg 1620 
ctgggagatg gaaagaccca cccaagaaaa 1680 
tgaccattcg acttagagaa gctgaaacac 1740 
tgatggaaat ggaaacacag ttggttgaat 1800 
gctgaccgta tactctctgc tccttccaga 1860 
catttagtga ttcgagtaag gctctggttg 1920 
ttgctggttt gaataaaaat ttcttcctga 1980 

1987 



<400> 45 

ctgccgggtg tgccgggtgt ccagcgaacc cctttcccaa accttcgggg agaagggagg 60 
tgggaggagg caaagaaact acaggcaggg agctggaagg gggggtgggg ggggcaggag 120 
acaagaaatc aagacaccag gcagcaggac acacacacac tcacatacac tcacacacat 180 
agagaccaac agatagacag ctacctaaag cctgaaagac tgacagcaac acagaaaaaa 240 
agaaacaggc agaaagagag acaaagacag aaatagaaac agactaacac acagagtcaa 300 
aaatacagag acagaaagac agggagaaag agaaacagaa aattagacac caaagacata 360 
cgaacaggga ggaaggccga ctgaaagaaa gacggagaag aggagagaga agccagggcc 420 
gagcgtgcca gcaggcggat ggagggcggc ctggtggagg aggagacgta gtggcctggg 480 
ctgagctggg tgggccggga gaagcgggtg cctcagagtg ggggtggggg catgggaggg 540 
gcaggcattc tgctgctgct gctggctggg gcgggggtgg tggtggcctg gagaccccca 600 
aagggaaagt gtcccctgcg ctgctcctgc tctaaagaca gcgccctgtg tgagggctcc 660 
ccggacctgo ccgtcagctt ctctccgacc ctgctgtcac tctcactcgt caggacggga 720 
gtcacccagc tgaaggccgg cagcttcctg agaattccgt ctctgcacct gctcctcttc 780 
acctccaact ccttctccgt gattgaggac gafcgcatttg cgggcctgtc ccacctgcag 840 
tacctgtgag tgaggccagg ggatggaggc agcttcatcg aggacaatga gattggctcc 900 
atctctaaga atgccctcag aggacttcgc tcgcttacac acctaagcct ggccaataac 960 
catctggaga ccctccccag attcctgttc cgaggcctgg acacccttac tcacgtggac 1020 
ctccgcggga acccgttcca gtgtgactgc cgcgtcctct ggctcctgca gtggatgccc 1080 
accgtgaatg ccagcgtggg gaccggcgcc tgtgcgggcc ccgcctccct gagccacatg 1140 
cagctccacc acctcgaccc caagactttc aagtgcagag ccatagagct gtcctggttc 1200 
cagacggtgg gggagtcggc actgagcgta gagcccttct cctaccaagg ggagcctcac 1260 
attgtgctgg cacagccctt cgccggccgc tgcctgattc tctcctggga ctacagcctg 1320 
cagcgcttcc ggcccgagga agagctgccc gcggcctccg tggtgtcctg caagccactg 13 80 
gtgctgggcc ccagcctctt cgtgctggct gcccgcctgt gggggggctc acagctgtgg 1440 
gcccggccca gtcccggcct gcgcctggcc ccaacgcaga ccctggcccc gcggcggctg 1500 
ctgcggccca atgacgccga gctcctgtgg ctggaagggc aaccctgctt cgtggtggcc 1560 
gatgcctcca aggcgggcag caccacgctg ctgtgccgcg acgggcccgg cttttacccg 1620 
caccagagcc tgcacgcctg gcaccgggac acggacgctg aggccctgga gctggacggc 1680 
cggccccacc tgctgctggc ctcggcttcc cagcggcccg tgctcttcca ctggaccggt 1740 
ggccgcttcg agagacgcac ggacatcccc gaggccgagg atgtctatgc cacacgccac 1800 
ttccaggctg gtggggacgt gttcctgtgc ctcacacgct acattgggga ctccatggtc 1860 
atgcgctggg acggctccat gtttcgtctg ctgcagcaac ttccctcgcg cggtgcccac 1920 
gtcttccagc cactgctcat cgccagggac cagctggcca tcctaggcag cgacttcgcc 1980 
ttcagccagg tcctccgcct tgagcctgac aagggggctc ctggagccac tgcaggagct 2040 
ggggcctccg gccctggtgg ccccccgtgc ctttgcccac atcactatgg ccggcagacg 2100 
cttcctcttt gctgcttgct ttaagggccc cacacagatc taccagcatc acgagatcga 2160 
cctcagtgcc tgagaccacc aacgggactc tgggcatggc tggggcccct ggacggcccc 2220 



67/89 



wo 03/050253 



PCT/US02/39133 



ttggctggct cctggcccta cttggggtga 
cacgttcatc agccacacgt ctaggcctta 
gggggacata gagggtccca gcctactgga 
acatctggac atctcggctg gggcagattc 
tggtgcactg catggagtcg aggtggaggt 
ccccctctca acccgctcct accccgctgg 
attcctttcc cattctaaac tccacgcagc 
gaagacaact cagcactggg cgggttcccc 
ccagcaagct gtgccttttg cgcctcttgc 
gaggtgtcgc ggggagcctg gtgcccttct 
gtccgagctc aataaatgcg ctgtgcaccc 

<210> 46 

<211> 2375 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506179CB1 



tggcGcgcct gtgagctgct gaccgtgggc 2280 
agcccacttc ttaaaggatc tgcacccatg 2340 
cctcccgagc tgcccttcag gtctaaagca 2400 
cccatcaaag gcaacctggg gggtgcaggc 2460 
cacgtgcaga gctccatgag gtcccggagc 2520 
tccctctgaa agcaccgagg ggttgatggc 2580 
caatcgcttt cctggaggag cgccaacctc 2640 
tctgctgctt gtgcgcatgc gtgcggccgc 2700 
gcgtgcgcgg ggcacaggga gggacagcgg 2760 
agatgggcga cggaggcaag gacaagacga 2820 
ccaaaaa 2857 



<400> 46 

cggcacgagg cacattcctg gggagtcctc agctgccagc atctgattag aaccatatct 60 
ctcgccggga gtggccgcgc ggctccgaag ctcccggccg gcggctattt aagcgaggcc 120 
cgccgcatcc gctgcgctgt agcctggagg ctccgggcgc ggggaagtca tgctcgcttc 180 
acggaggcaa tagctagccg gtgtctgtgg gaggttatgt ttatttgaga gttctccatc 240 
gggatcgcct ggtgtcacca agtgtccact ggtactgagg tttgctgcct gccttcttgc 300 
catgtctaac gaagtagaaa caagtgcaac caatggtcag cccgaccaac aggccgcacc 350 
aaaagcaccc tcaaagaagg aaaaaaagaa aggccctgaa aagacagatg aatatctctt 420 
agcaaggttc aaaggcgatg gtgtaaaata taaggccaag ctgattggca ttgatgatgt 480 
gccagatgca agaggggata aaatgagcca agactctatg atgaaactaa agggaatggc 540 
ggcagctggt cggtctcagg gacaacacaa acaaaggatc tgggtcaaca tttccctttc 600 
tgggataaaa ataattgatg agaaaactgg ggtaatagag catgaacatc cagtaaataa 660 
gatttctttc attgcccgtg atgtgacaga caaccgggca tttggttacg tgtgtggagg 720 
agaaggccag catcagtttt ttgccataaa aaccgggcaa caggctgaac cattagttgt 780 
tgatcttaaa gacctttttc aagttatcta taatgtaaag aaaaaggaag aagaaaagaa 840 
aaagatagag gaagccagca aagcagttga gaatgggagt gaggccctaa tgattctaga 900 
tgaccaaact aacaaactga aatcgggtgt tgaccagatg gatttgtttg gggacatgtc 960 
tacacctcct gacctaaata gtccaacatc ttcagccaat gacttgcttg catcagacat 1020 
ctttgctcct cccgtctcag aaccttcagg ccaggcgtca cccacaggac aacctacagc 1080 
cctgcagccc aaccctctgg atctcttcaa aacaagtgct cctgccccag tggggcccct 1140 
ggtgggtcta ggtggtgtaa ctgtcacact ccctcaggca ggaccatgga acacagcatc 1200 
tttggtcttc aatcagtccc cttcaatggc tccgggagcc atgatgggtg gtcaaccttc 1260 
aggttttagt cagcccgtca tttttggtac aagtccagct gtttcaggtt ggaaccagcc 1320 
ttcacccttt gcagcctcaa ctccccctcc agtgcctgtt gtctggggcc cttctgcatc 1380 
tgtggcaccc aatgcttggt caacaacaag ccctttgggg aatccttttc agagcaatat 1440 
ttttccagct cctgctgtgt ccactcagcc cccatccatg cactcctctc tcctggtcac 1500 
tcctcctcag ccacctccca gagctggccc tcccaaggac atctccagtg atgccttcac 1560 
tgccttagac ccacttgggg ataaagagat caaggatgtg aaagaaatgt ttaaggattt 1620 
ccaactgcgg cagccacctg ctgtgcccgc gcggaaggga gagcagactt cttctgggac 1680 
tttgagtgcc tttgccagtt atttcaacag caaggttggc attcctcagg agaatgcaga 1740 
ccatgatgac tttgatgcta atcaactatt gaacaagatc aatgaaccac caaagccagc 1800 
tcccagacaa gtttccctgc cagttaccaa atctactgac aatgcatttg agaacccttt 1860 
ctttaaagat tcttttggtt catcacaagc ctctgtggct tcttctcaac ctgtatcttc 1920 
tgagatgtat agggatccat ttggaaatcc ttttgcctaa attctgaact tggtctgcag 1980 
accatccaga ggaataaaaa ggttggcctt agtagtcaaa aacaaagctg atagccagac 2040 
acgttctgat ttctgccctt gttccagctt tgacgtatta tctgttgcct tatttctcat 2100 
tgcctcttct acttgtaaaa tgcttttcac tttctgtcta ggttaaagct aaactgaatc 2160 
tatggcttta aataaattaa gatcctaaac tctctagctt aagtgtaaat gaagtacagt 2220 
agtttcccta ctgaaccctg cctcttgtgt ccctggaacc ttctagaaca cctgccttct 2280 
accctctggt tgggagatgc agccaccaca tcccttcata tcatactgtt ttgaataaat 2340 
tttcaaatcc ttattgttaa aaaaaaaaaa aaaaa 2375 
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<210> 47 
<211> 486 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 1938744CB1 

<400> 47 

gaagaggcag atggtaggaa 
gatgggatac ctgggaggga 
agggtcttag caggtaacct 
ggagtgcctc tctctaggac 
ccaggattat gtctgctaac 
ctgagaagct cctgaggaag 
caatgtacag cgattacgtc 
ataggagccg gggctgtcca 
gttttg 

<210> 48 
<211> 1527 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 5557436CB1 

<400> 48 

ggcgctgcgg gacggctagc ggccctgcgt gggagttcct gacagctgga ttctagaagt 60 
agaactatga gctcaccttt ggcctccctt agcaagaccc gaaaagtgcc cctgccctcg 120 
gagcctatga atcctgggag gcgaggaatc cgcatctatg gagatgtccc tgcatcccat 180 
gactcggagc tgatggcctt catgacgagg aagttgtggg acctggagca gcaggtgaag 240 
gcccagactg atgagatact gtccaaggat cagaagatag cggccctaga ggacctggtg 300 
cagaccctcc ggccacaccc agccgaggca accctgcagc ggcaggagga actggagacg 360 
atgtgtgtgc agctgcagcg gcaggtcagg gagatggagc ggttcctcag tgactatggc 420 
ctgcagtggg tgggcgagcc catggaccag gaggactcag agagcaagac agtctcagag 480 
catggcgaga gggactggat gacagccaag aagttctgga agccagggga ctcattggcg 540 
ccccctgagg tggactttga caggctgctg gccagcctgc aggatcttag tgagctggtg 600 
gtagagggtg acacccaagt gacaccagtg cccggcgggg cacggctgcg taccctcgag 660 
cccatcccgc tgaagctcta ccggaatggc atcatgatgt tcgacgggcc cttccagccc 720 
ttctacgatc cctccacaca gcgctgcctc cgagacatat tggatggctt ctttccctca 780 
gagctccagc gactgtaccc caatggggtc ccctttaagg tgagtgactt gcgcaatcag 840 
gtctacctcrg aggatggact ggaccccttc ccaggcgagg gccgtgtggt gggcaggcag 900 
cggatgcaca aggccttgga cagggtggag gagcacccag gctccaggat gactgctgag 960 
aaatttctga acaggctccc caagtttgtg atccggcaag gcgaggtgat tgacatccgg 1020 
ggccccatca gggacacctt gcagaactgc tgcccattgc ctgcccggat ccaggagatt 1080 
gtggtggaga cgcccacctt ggccgctgag cgagagagga gccaggagtc acccaacaca 1140 
ccggcacccc cgctctccat gctgcgcatc aagtctgaga atggggaaca ggccttccta 1200 
ctgatgatgc agcctgacaa caccattggg gacgtgcgag ctctgctagc gcaggccagg 1260 
gtcatggatg cctctgcctt tgagatcttc agcacattcc cgcccaccct ctaccaggac 1320 
gatacactca cgctgcaggc tgcaggcctt gtgcccaaag cagcactgct gctgcgggca 1380 
cgccgagccc cgaagtccag cctgaaattc agtcctggtc cctgtcccgg tcccggtccc 1440 
ggccccagtc ccggtcccgg tcccggctcc agtccctgtc ccggacccag tcccagcccc 1500 
caataaagca cccgccccct ctaaaaa 1527 

<210> 49 

<211> 1422 

<212> DNA 

<213> Homo sapiens 

<220> 



atggagacgg agaaatggca 
catcttttca agtagaggct 
tcctttcctc tccagactga 
Srgggctgcag cataggagtc 
agacgctggt gggtaccacc 
actcgggaat ctccactggt 
aagaggatgg cacaggatgc 
actcccctaa ctcaatccct 



ctaatggccc agccacctga 60 
gtgttggggc atggtgagag 120 
ggaatcagag ttctgattgt 180 
tcagctgctt acatccaggt 240 
tgacgacgaa gactgtgtgt 300 
gcctataggt gctgtgtaca 3 60 
tggagagaag taggactcct 420 
ggtacattcc taataaagca 480 

486 
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<221> misc_feature 

<223> Incyte ID No: 7506178CB1 

<400> 49 



??cgaScta ItallaTrtt f^T^^t^ ttcctctctc gacacttcgg taattcctct 60 
cStcctt ttttrnnlnl f^^^f ^ tgtgtcaagc gtatgccccg ggattctcct 120 
ccgcttcctt ttctcggtct tccttcttgc tttagggacc ggaagagtcc ttaaaccaaa lan 
atagctcggc gggcacttcc ggggccggcg cccagagttc cgggagSg caggcaggS; 2^? 
agggaaaggc agcagcggcg gcagctggag gatgaagaag glgcatgjgc ttcIcScS 200 
gttctccgcg tggtacccgt tcttccgagg cgttaccatc aagagtgt?a tlTttcallt leo 
tTtTaTacl iTollToil atttactcga tgatggaact ctgg^g^tt cagg^aggS 20° 
cgatccacca acacattctc agccagacag tgatgatgaa gcagaagaaa tacaataatc 480 

ccaggaS? ^tcaa^r"" ^^^^^^^^^ accagaattt cctgag?ttg ccacSSgt 540 
ccaggaagct atcaattccc tcgggggcag tgtctttcct aagcttaatt ggagtgcccc 600 
ctllTaTtt ttL'?^"'^? caatgaatag ttctctgaaa tgtaaaaccc Sgacat slo 
ctttctgctt ttcaagagtt ccgatttcat cactcgtgac ttcactcaac cat^^a^^r.^ iln 

tSttSS TclaTa'T -T.^'^^^' agaatatgag ctcgttctcc gSa^tg^tg 
gScgSata iTclataTr.^ agtttcgatg ttttgtcaag gaaaacaagc ttattg£?g? 840 
gctcgatata tacagagaca gtagggggaa ggtgtggctc attgacttta atccatttaa 90n 
cggcgatttt Staaaatf ggaagaactg ataLtgaJ TallTtlll 9 

^^'^^f^^ttg acgctcaaga gcaggattcc ccagctttcc gttgcacaaa 1020 
cagtgaagtg acagtccagc ccagccccta tttgagttac cgcctaccca laalltttlt Tofin 
agacctctct actggggagg acgctcacaa gctaaLgac t??ct?aagc tSagagaS llTo 
gctcSS ctaftcir' ^-^=^*^-tg taactggaga agaggaggcc ccgcccSc LJo 
gctccgggag ctgctcatca gccgcaactt cctgccgacc ctgatgcggg tggaccaaac 1?fin 
gacttSct: cttotf"" "tatattca tgtacattca cc?gggg2a SacggS ^320° 
StcSataa llllltaltt iTt^^^^t ^^^agatctt aaacatagga aaaccaSa?? 1380 
gcccngacaa taaaatgctt tctatgaaat aaaaaaaaaa aa 1432 



<210> 50 

<211> 1822 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7506235CB1 

<400> 50 



cggtggcgga ggcggtgagt gcgcggctcc ggggctggcc gactccgcta ataacccaar fin 
t?aa^f^^^ ^"^^^^^-^^^ cgggctctgg gctctgggtg cgcggaSg gcclggctgc 120 
^t?a^^?^^^ cgcgacctgt gtcagcagag ccgccctgca ccaccatgSg catca?ct?c 180 
tttaagtttg atcctcgccc tgtttccaaa aacgcgtaca ggctcatctt aacaacca^n tin 

Sto!gtg aT.Zrclt ^^^^^^^^ ttagc^gact ?^tgggggaa ?Scgag 3^ 
tr.t^^t^^ ggefeggacat ggaggaaggc aaggaaggag gcacatggot gggcatcagc 360 
acacgtggca agctggcagc actcaccaac tacctgcagc cgcagctgga cSgcaggcc 42S 
cgagggcgag gcacctacgg gctgagcaac gcgctgctgg agactcccfg gJggaagctg Ho 
tgctttggga agcagctctt cctggaggct gtggaacgga gccaggcgct gcccaaaaa? tlo 
gtgctcatcg ccagcctcct ggatgtgctc aacaatgaag ag^cgcagc^ gccagaccca loo 
?gcgtgSS gccS?I cagcccatgc t^gLa^ta ScgSSS 66 

ggccacSSa cc?tcfS ^^^^ ^ ^^^^^<=^^ta tcatcctggt agatgcggac 720 
aaaacc?a?^ ^^^f gcgtagcatg atggacaagg acctctccca ctgggagacc 780 

agaacctatg agttcacact gcagagctaa ccccacctct gggcctggcc agtaacjctcc 840 
tggggggccc tgccttgagg ggcactgtgg acaggaaacc t£cctt?gcc atac^catt 900 
lltrlTrT ^^^^"ggcc agcatccccc ggatcagggc cctgtgg^tt gcgtg?tacc 96^ 
ttlttfJ ^^^f ^-^^^^ gttcagggtc tgcctttatg ccagt^gga gcIgcagaSt 1020 
aScttacac atSf ^^^^ ggccgaggta taccatgaac atgtgga?ac acc^gajccc losS 
actcttgcac atgtacacag gcactcacat ggcacacaca tacactcctg cgtgtgcaca 1140 
agcacacaca tgcaagccat atacatggac accgacacag gcacatgtac gScacaaat 12q2 
gtgctacaca tgtgcacaca tgcacagttg cacagacaca cacacacagg ?gcacacaS ^26? 
cgatgccgaa caaggcagaa gggcgactct cacctctcat gtgcttctgg ccagtaggtc lllO 
tttgttctgg tccaacgaca ggagtaggct tgtatttaaa agcggcccc? cctctcc?at ^3^ 
ggccacagaa cacaggcgtg cttggactct tgacaagcag accgctcct gcSggaS llTo 
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ctcactgcaa ccEccgcctc ccaS^c^^ nZ^ff^^^ Stgcagtggc acgatctcgg 1560 
ctgggattac aggcg^gtgc caccEaccc aonf ^^^^ tgccccagcc tcctgattag 1620 
gtttcaccat g£?ggtcagg ctggtcSaa acSf f^^gttttag tagagacggg 1680 
cctcccaaag tgctggga?? ll^al^^l^ actcctgacg tggtgatcca cctgcctcgg 1740 
cataacagac t?tccagcac l^^^^""^^^^ gccaccacgc ctggccgaaa aacccactc? 1800 



<210> 51 

<211> 6104 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1302184CB1 

<400> 51 



?SSaSg a:cc^£S SggSSt tlTtTr ^^^^^^^^^^ = 
Ctcacctttc ccttttccct Ittrn^nt^ ttgttcttgg aattttgggt gtcctctttt 120 
gattgtcatg gSccaggg a"ccttaaa caao^""'^" gaactccgga agactgtaga 180 
cttcctgccg gtctcctcag ccaagccccc agSctcc? gcgagtcaga 240 

gctctccact ccaaaaccta aa^;,^^^^=^ agctgctcct ggctgtgcag gagaaccttt 300 
ggaggaggat gadget cSgcggga tgSattcc f^f^^^^^ tggagccaga 360 
gaaatgggtg gcaggagatg gt^^ggflga gSggai^tt t^tltT^""^ cggacagtga 420 
cacagaggga agtctgaagc taaaaattca ^t^^^T t-^tatcaagg aggcaaactt 480 
gaatttggag aactaLtat gtccacctga aat™^^ cgggctaaga aacccccaaa 540 
ccagaaggtg tcccgtgctg gaaSSaSg caaaacca^a f ^^^^^^^^ agtctgggga 600 
ctccaaaaag aagctcctca cagccaoSa orhf^^f ^ aaggaggaag aaagaagcca 660 
gccacaggct ta^gagaggc llllglHZ ttcStc cat^^tL'ca ^^^^^^'^'^ ^^0 
acaggacttc accggtgaca ccttaaaarn cattatgaca cgggcctccc 780 

ccacatggac tggtccacca actcta^nf^ aaagcaccag caaaaaagca gcagccagaa 840 
tccagaStct ggSagaaa atlrl^^T^ cggacccgtc actcagaatt gcttcatcag 900 
ttcc?t?gca Sggcccagg gtaagalaaa cTtT'"'"'' cccgctcttg agcccgtggc 960 
taacaataac aaagatctgc lltlaaatT, ^^f^SSr aacacgtgga gtcagttgtc 1020 
accagcccca cccagcagct ctgc?Sa?a cT.rT ".t ^ccccaagca gccacagctc 1080 
tggaggaggt acaaagSgc cccc^aaacc Zntl^'^^'l^ cagcccttgg tggatcaaga 1140 
taaaaaagat gtgataaSc lalrt.t^^^ acctacggtg ggcagcaaga aaaagtccag 1200 
ccagaaagca ?t?gacaata claaaa™ aaacccagac ctggattggg tcaagaatgc 1260 
agaggcaLa ccagccaggc agaacgSaa ttc?S^^ tattccgcag atagtgccca 1320 
tcatgtccgg attfctatcc cSS^gc accctctSa aaS^.'f ^'^^-'^^^--^ 1^80 
gaaaaaaaga cagtccatta aagcggtgjt ggaaaaaatc ^tl^Z ^""^ accataagag 1440 
ttctggaatc accatgagca gtaaaata«^ 5 ^"^Sf^^^gaga aagccttggc 1500 

taagaaggat ccccgfgtcS ctaaStafa ctttccaact ctgaggggaa 1560 

tggccttgaa actggtggaa atgcSSa latttt^ Sragaatgagt ccccctcagt 1620 
gaagccacco atggtca£ga cllatrf^^tn ^^^^^^^ ggaggtgtgt ctaagccgcg 1680 
gccagaaatc cagca?ccfa aatttactac f ^^^^^^^'^ cactctccat ccagaaagcE 1740 
acctagcacc atgc?tcgag aggcaStat tggacttgca gcaaaccaaa 1800 

cccgtctgca tatccca?c! cc?ca?ccag ccctctctac ac^^^^*'^^ tgctggagcc 1860 
tgtgatcact ccagtcaaaa aaaao™^ =^^1 accaacacag acagtcttac 1920 

cgagacgatt catgagggaa Stccaccaa aagcagcctt tgctcacagt 1980 

tggcactaag aaaagaaagc gacaa™« ^^^^srtcagt cccatcagcc gagagtttcc 2040 
agaggacaaa ccca?gagcg agaJgSSt tca^a^^f"^ ttggcccagc tagtgccggg 2100 
ggataagaag accatcafa! cStcaat^J alta^t^ gttggaaagc tcggcgtgtt 2160 
gaatcagatc ttgtcctgtt ccaacaacot ctcaagagga aaaacatctt 2220 

cagcGctggg gcagcagcca ttafan^.^^ tgctctgaag gcaaaagctc ccccagagac 2280 
gggaaccatc ?aS?tggca lallacaaaa ^^'^^^-^^^^ cagattaatg tcagcaagag 2340 
atccgaagaa cccaaalS ccatcaaaca ono^^^^'^'f agagcagagc tgccaccccc 2400 
tccagccgtg ccttccaaS tt^agt^Ict tgtgScatct ^J^^^^^'^'^ agccggatgt 2460 
actttcaaca cagttaggtg ggtccaataa rf^n^^^ tcaccagcag ctatgcaccc 2520 
tttttcagag ttgaaaacta lalaltttnf ''^^^^^^^^'^ cctgccagca ctgaaaccaa 2580 

ccaaaaggga atlcacagtg glacctSaa SctatcSa "^'^^^^^"'^ caaccaaaac 2540 
y u gaacccggaa gctgtctcca cccagactga tggccaactc 2700 
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cccttcacac ctgtgcgaga ttggctccct aaaggaaatc acgctgtccc ctataaaraa ?7fin 
ftctaa^^*" ^"^^"^""^^ tccccagcga cagcggcatt gg^aclgaca acaacagcac 2820 
ttctgaccaa gcggagaaga gctcagaatc ccgaaggagg tactctEttg at??ctgctc 2880 
cctggacaac ccggaggcca ttccgtccga caccagcaca aagaaccggc atggccaccS llll 
gcaaaagcat ctcattgtgg acaactttct ggcccacgaa agcctcaaga agccaaagca lolo 
caagaggaaa cggaaaagcc tgcaaaaccg cgatgacctc cagtttctgg caSacctaaa llll 
ggagctaatc accaagttcc aagtgttcag aatcLccac cggagttac? ccScSSa llll 
cgagaatcca tatcccagca tttttcggat taattttgat cactattacc cggtgccaS 3^80 
tatccagtat gacccgttgc tctatcttcg taggacttca gacttgaagt cSSaaia 32!^ 
gcgtggtagg cctgcaaaaa ccaatgacac catgacaaag gtgccSS^ tJcSaaaa?t Wll 

llllXZ llllTtltl tgcLccta? ^gSctt' ScSt 

gcccatgatg aaccttggtt attacggtca gtacccagct cctttgtacc tatcar^r^n -^Aon 

TolTaTtl llllTallaa T'^^T"" aacagtgLa ccacc^cS tTol'aTclll 
cncccacgta aagatgtccg gtgcagctaa gcataaagcc aagcatgoaa tacaccfrrr-?, tr^h 
gggacctgtt agcatgggcc ttggtgacat gcagcctEct ctgaaccct? ccaaSataaa 36^ 
cagtgccagt ctgtccagtg gtcggctcca taagaggaaa cacaaacaca agca?aaacf llll 
caaggaagac cggatcctag ggacccatga caacctgagt ggtctttttS caagcaaaac lltl 
Ictaafallt '^^^^^^'^^^^ tcctgagcga gcggctgajt agcgcagaca aagagc'cc 3780 
gctggtgagt gagaagaaca agcataagga gaaacagaag caccagcaca gcgaaaccaa 3840 

cgcSSat SS^a^ ^^'^'^^^^^ ..acaccct^ tctac^ SSccg llti 
cgcccagcat tggacacagg ccaaggaaaa aggagacttg agcagtgagc ctcrtaaactr iQfin 
atgcacgaaa agatactctg gcagtggcgg ggatggtggc agca^gagat cgSgaacct 4^2? 
tZlTaalaa ^^Tt"^^^ acccttcgaa tgacaagtgg glcag^gacg tgSggSg iolo 
taaaaggagg agctatgaag gctttggaac gtacagggaa aaggacatcc aacrcctteas li In 
gatgaaccgc aaggagagaa gttcttatga ctcctccatg tcfccaggga tgccaagtcc tloo 
Stgaccaga aaaaaacn^" -^<=agtgca tagtaagaac gaaggc^cag tgccSacc^t 426? 
ttofnt^^f^ aagaagccag ccgcagttga cagtgttaca attccaccag ccccagtgtt 43^0 
atctctcctt gctgcatctg cagcaacgtc ggatgcagtc ggctcctccc taalatlalc, All 
caSaScS ItlloT"^ aagccatcca gtgcgaagtg Sgaagatgt IZlTtllll tlTo 
caagatcctg tccaccaaga agaacctgga ccacgtgaac aagatcctga aaaccsaarcr l^nn 
gctgcagaga caatcaaaaa caggcaacaa cttcgtgaag aagaggcgcg ggcatcccaa till 
gaagcagccc acccagttcg atgaggactc cagagaccaa atgccSgc ggaaaaatg 4620 
iTrTr. '^'^^^^'^aaaa gaggccagaa gcccagcctg agcccgctgg SSggagcc ttlo 
^^^^^f caagacacca tcatggccac catcgaggcg gtcatlcaca tggcccgggS till 
gcc?cc?cca ccccctcSc "^^^^'^^^-'^ gcccctgccg ccaccgccgc caccacc^ct 48oS 
ggcccccact atZtnnnt^ ccaagacccc ccgaggcgga aagaggaaac acaaaccgca 4860 
ggcccccgct cagcccccac agcagtcgcc cccgcagcag ccccttcccc aaaa^^rr^^^r,^ laon 
cStctaaa c^tc^"^" ^aagtcccg agggag^gaj agcgaggtcc ttloolSg till 
^^^^^^^^^^ cgtctgcacc tggggcctag ggaactgaca cgtgggaagc gcagtgagcc 5040 
^^^^^^-^^^^ cgctgcaggg acacccacgc cct?c?ctL agaagccg^g liol 
caggcagaat ccggccagac gacggggctg agccatcagg agctcttggg aaaglaaaac ?i fin 
agggagacac cttcagaaga agcttgtctg agcttcaccg cagctcccac cacgcaacac llll 
lltTnt^'^f ctggatcctc cgcaggcgag cggaaggccc ccaggaggag caggcSg^g 528? 
tZS^lc S^:^":^ tgccaaggtg cgccctcgac tcccga??tc a?Sggc 5340 
gaaagggaaa eaccctcaat taggaaacat tccaagggga gaaaagctgc 5400 
aaattgctca actggcattg ctgtaacccg ttccatttaa ttggttttta tttcSttaa llll 
ttttcaagct ctgctaagct ttggggtacc aacgtcatct tcaggtgacc tgaatc^ttc Ittl 
ccttaaccgt acagtttctc gatggaattg tgtgatcaga aggtggaatt cSgtaataa llll 
gcgacctcag acccgcattc atgttctgtg tgcctcttct a??gcSata cJctgattS llll 
tagcattgtc tattcctatt tttcctttgc ccattgtact tccatatJtc ttttfattaa llol 
cttacttgct gccttttttt tttcttggta cacatttaaa taaagtaatc cttaacctat 576^ 
gctgtaaagt tcacccttgg catgctgttc caagaacctg ggtt?gaatc ccaatcgtta 582^ 

S'accca ScaS "^Ttlll "^^^^^^ a?gcagagca gc^Sta llll 
caaaftaaa^ f^^ft^ t ^^ST'^tatttt tatacttaaa acatgtcagc agagcatagg 5940 
Saacccca allllTrttl ^^^"^^^^^rc aaatagagta actgacaaac cLcaaaaac 6000 
SStcaaJt aattShn^^f ^^^^^caagt gtctagcaat gccttggtac ctgatctttt 6060 
ccattcaaat aattgtcata ggttgttcaa aaaaaaaaaa aaaa 5104 

<210> 52 

<211> 1137 

<212> DMA 

<213> Homo sapiens 
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<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7506232CB1 

<400> 52 
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ggagtgggag cgagcggcgg ggccagctgc gttctgagcc tgggcgcagc tgccatctac fin 
tctgggaagc accagggtgt ccccgccgcc ctcagctcga agtcagccac cltaalaarn iSn 

aS^cSac? ?ctSS? TclTar' "^^^^^^^^ ^'^^^-^^ £lTtZl 
nflr.^l^t^ tctctagcat gctctctgag gaggaaaagg aagagttaaa agcagaatta 240 

iiiiiiliiipiiii 

iiiiiiiis^iiiii 

SStS tSStaSa f -^^"^ aaatgg^ccc tLaSL'gt tSatS ? S 

tgatggagca cttggacatc acattttcag tttcacaaat Qnn 
?S2t5 "gta^^^^? ttc^^^f f acaggtccac aacgggtcti attcSJcg 9 2 
t|cttatga^ ^Sgc^ c^^ggS aS^Sc ^^^tLI^^^ ^^Sfa 
gatggggcct tccgacggtt tccgcaatag atggttcctt aaattcatcc accctgt 



1020 

1080 
1137 



<210> 53 

<211> 1011 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 2585358CB1 

<400> 53 



fogllToTt iToTtrT '^Srcgcctctg ggataggcga ccacggtgtc ttcaaaagcc 60 
SaatcaaaS ^^^^^^^^^g gggccggacc gactgtgggt cagtttgcac cagcgctctg 120 
gaatcgagtt acgcgcgaaa gggcagagtt tctggaggaa accgcaacct rtLL^^^? ton 

iiE iSE iHEE iiSi = Hi 
=s sssi ~i H i ii fiS £S 

= iSS EiS 1= 

sss" ?s?rT isiss 5fs 

2egct??3^ ScSS ?ccoact^;2 .cc.gottto cct„tg,tg Igg.L.?^' ^sS 

ctt?,=.tS ?c??=SS^ ^SSS SSSSf 

gctactttcc tgggagggcc tcatgtttta tctggttctt aaatathfah l.r. 
aataaaactg cgctactaaa aaaaaaaaaa aaacctc^^g cSttSg ^'^'^^'^^^^ JJJ, 

<210> 54 
<211> 4638 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 

<223> Incyte ID No: 3961495CB1 

<400> 54 
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gaccctttga aactgtcgag tgacctggga tcctgttaaa cactgacctg ctttataaaa 60 

ii =1 FFF — — ssl s 
i ?• ."SS£ =^ 

yuyaacuca gcattaagtg ctaaacaaaa tggatccagc agtgctatac aasarrrr;3=,rrrr ^cn 

aagtggagga aatgcttggg attcaggacc tcctgctggt cctggaatac tcg^cSaaa 42^ 

aaggggcagt ggcaacaatg gcgttggtaa tatccattca ggagcttggg aSacccca^ llo 

caSScatc 'T''''^^'^ tgaatgggga atggggaaag cccccaaacc Sca^tccal 540 

ill iSE SSs E=i 5S 2S£ = 
iS = iSS iSS iSs Sir? =s = 

sss =ss shr SIS = 
Sp EiS =3 SS Eri 11 

ill gllli iss =i isf 

tacaaactgg ggggagactt ?aaaacctaa It^n^^''^''^ aatacaaagg ccaatccagg 1440 
agacaacaa? gtgagSact ggggaggagc tgctt^tgt? ^a^caaaSa 1=°° 

igfgllsssHSEi ifi 
?=iii=iH!iiiit2HHSii.i 

gaagccaccc agcagtggca gcgggtgggg T^lllTo't gcaggSS cggtggS?? llTo 
tggaagagct ggcgcacctg ttgctgcctc agccctgtgc aaaccfgc?t cSaafcta^ ?1 
gcaagaaggc tggggcagtg gtggggatga aatgaaccL agtacSgcc agtaaaaaaa 
cIgSgaa? TcTcf.lll llT.lTtl''' akccSct S??S?c 22 

t£S tgg I^c^Sgc: :::a^S SSggS ?e^--S: 

i i S iiii ~ ~ SaS^ sSi iH 

~ ii E — -™ -aS 

gattgcctcc gggctggS tgc^aaaett cccagtcagg ccctgggtgg 2640 

sss s i «™ =1= ^?.°s 

tc.g?t,a.g ct,g=«2c Z^tT.^ ^IJcSgSl c^St^tfJ •■=™3<="" ^'^ 

iiilis^L^'ipLllli 

gcagtottc. ocMacacot ItgcS'ctta c???ctcSr r,^^^? " 3210 

~s ' 33%°0° 

cctaal^n^^ cagtggacgc accccaactc catggataac ttgcccagtg ccgcttcccc 3420 

mmmmmm 

cactgggcct accatcaaca ccaccatcca ggatgtcaac cgctacctcc ?caagStgg SYsS 
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c?ctctarS Itt^^^Tt ^^tcgacgtg gtcctctggc cctacctccc acacgcaagc 3 840 
tccSStta ^^ca^tccca ITcTl^ ^^^^^^^^^ actgcaccca cgaggc^^cc 3900 
cage? ta^ llT.T.T.l SSSS ^acc^S ^cag^aS L^^SS ^^.^^ 
gctcgttctt cgaaacctca ctccccagat tgatggttct acac^gcgga catto^atS ISfiS 
gcaacatggg cctcttatca cattccacct gaatcJgact caaggcaS c?gtgScca tlTo 

-sss ssss ?=£i 

ii iSF °— ~s ssss ~ 

cccgtgcctg ggtggcaagg ggagcagtga gctgctgtgg ggcggggtac cccaatac^r^ Jlln 
cagcagcctg tggggcccgc ccagcgccga cgacagcaS Kga^fggca gccccacacr tttn 
gctaaccacc ctgctgcctg gggacctgct cagcg^ggag ?ccctSgg ctctgccSc tsGO 

ssr ^^^^^^^^^^ -^'^^--^ --•^---^ -^s 



4538 



<210> 55 

<211> 2233 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7500801CB1 

<400> 55 



tlfZ':^^^^^ tgccgggtgt ccagcgaacc cctttcccaa accttcgggg agaaggaaaa 60 

ss= ^~ ~ ? 1= 0 
=s sLi:— £i E 

aaatacagag acagaaagac agggagaaag agaaacagaa aattagacac clllalc-^tt lln 

SHE ~ ~ ~ =si i i 

~ liiE iira iiis Eii iSii Hi 

ccggacctgc ccgtcagctt ctctccgacc ctgctgEcac ?c?cactcg? caggacaaaa 7?0 
Icct^^^^'^^ tgaaggccgg cagcttcctg agaattccgt ctctgcacS gc?cctc??c 7S0 
?acctattca tcaiaaacL' f "^^^^^^ gatgcatttg cgggcctgtc ccacctgcag 840 
caccccctca tcgaggacaa tgagattggc tccatctcta agaataccct caaafrrrLh^ onn 
cgctcgctta cacacctaag cctggccaat aaccatctgg agaccctccc caaa??cSa lie 
tgcScc SSSccI S^t^r^"" ^acctccgc'^ gSaacSgtt ccIgtgS? 10 

£tF ---s :s::Sa ]^ 

SSS ?c?c--- SSS ^^ca^S^- - i 

ill iii !S= iSS SEl ssi 

^aar^^^^f^ agaccctggc cccgcggcgg ctgctgcggc ccaatgacgc cgagctccta 1500 

||H SiS SlSl SIS isS iiSi i: 

cccgaScg agga^tct" S^caf'^^ ^^tggccgct tcgagagacg cacgSS iS'L' 
tgcctcacac aofsn^l^n^ tgccacacgc cacttccagg ctggtgggga cgtgttcctg 1800 
tgcctcacac gctacattgg ggactccatg gtcatgcgct gggacggctc catatttc^crh iHfin 
ctgctgcagc aacttccctc gcgcggtgcc cacggtccca gcctacSa cJtcccSc 
lallTataai iTalaT^''" -atctggac atctcggctg gggcagaS^ cccatcSfg 1980 
g^SSSe SStS acSS a\~^ SS^S ,^^2 
to^^^a.'^"'' ^^srttgatgg cattcctttc ccatLtaaa c?ccacgSg ccaatcgc?? 2^2 
tcctggagga gcgccaacct cgaagacaac tcagcactgg gcgggtLcc ctctgcgc^ 222? 
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tgtgcgcatg cgt 

<210> 56 
<211> 4884 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7506414CB1 

<400> 56 



PCT/US02/39133 

2233 



cactgctcgc ?acaaat^S^ '^'^^^"^"^'^ aaccggcacg tggagtcggt ggataagcgg 60 
cactgctcgc tgcaggttgt gccagaggag atttaccgct acagtcgtag cctcaaoa^a i7n 
ctgctcctcg acgccaacca gctgcgcgag ctgcccaagc cttStKcg gcStaaac 180 
ttgagaaagt tgggcctcag tgacaacgag atccagcggt tgcctcccga gaSaccaac lln 
ttcatgcagc tggtggagct ggacgtgtcc cggaacgaL tccct^agat cccggaSaac 300 

ccSg^t ItlTclaT. '^^'^'^^^^^ gacttcagcg ggaaccc^ct ctcSS^c ITo 
cctgatggct tcactcagct gcgcagcctg gctcacctgg ccctgaatga tatatctata A^n 
caggcactgc ccggggacgt gggcaacctc gccaacctgg tgaccctgga gctccgggag 48^ 
rtt^l^''^''^ agtccctgcc agcgtccctg tcatttctgg tcaagctgga acagc?gga? tlo 
ctgggaggca acgatctgga agtgctgcca gacactctgg gggctctgcc caatcScaa Itn 
gagctgtggc ttgaccggaa ccagctgtca gcactgcccc SgagctcgJ gaacctacaa fiSS 
cgcctggtgt gcctggacgt gtcggaaaac cggctggagg agctgcctac lalatlfa^n 7on 
gggctggtgc tgctcactga cctgctgctg tcccagaac? tgctgcggg gctgcccSc 780 
ggcatcggtc agctgaagca gctatccatc ctaaaggtag accagaatc? Sctatacaa^ lln 
gtgactgagg ccatcgggga ctgtgagaac ctctctgagc tgaSctcac Jgagalc?ta 900 
ctgatggccc tgccccgctc cctgggaaag ctgactLIc tSaccaacct caacataaac 960 
tlT.T"''^ tcgaggcgct gccgcccgag atcgggggct g?gtggcact cagcgtcctc 1020 
tccttgaggg acaaccgcct ggccgtcctg ccaccagagc tggcccacac gacaaaacta inaS 
cacgtgctgg acgtggcggg gaaccgcctg cagagtctjc cg^tcg^gct cacccacctc llTo 
^t^T",^^^ ccctgtggct ggcagagaac caggcgcagc ccatgctccg gttccagacS 12o2 
ccacKa^n? cccggaccgg cgagaaggtg ctcacctgct acttgctgcc cc^cagccc 126? 
c^ar^S=r ^^^f^^^t^^ tgggcagcag gggagcctct cggagacctg gagcgSgcc 1320 
TJaatt^nt'' gcgtcagcgt catccagttc ctggaggcco ccatiggtgl tgSgacgct 1380 
ctcfaaatS ''^^^^ gcggggccta cagcgccggg ccacacctca ccSagcglg llTo 

ctcaaggtga tgaagaggag catcgagggg cggcggagcg aggcctgccc ttgccagcca 15^0 
llfnT^^^^ cgcccttgcc tgcagaggag gagaagcggc tgagtg?cga gtctggcctg 1560 
agtgaagact ctcgcccatc tgccagcaca gtctctgagg c£glglccga gggcccgtca JfifS 

SSS'? SScS S= a%~ ~ 

~Si f=~ 

ss^ss^ -ss cds £sS 

KSSS t.gtcctc„ .ac,.,»?g° 2 « 

casjgccgaag aggaagaggc cagcactgag gaggaggaca aggaggocac cataai- i-t-nt- on nn 
gcgccctctg tcaagggagt gtcgtttgac caSccaata afftSt Sag^ctgct 2^6^ 
cgcattgagg aggaagagct gaccctcacc atcctgcggc agac£ggggg cotgggcatc 222^ 
agcattgcgg gcggcaaggg ctccacaccc tataaggggg acgacglggg catat?catc 228^ 
ctggSS atgggSg^ ccctgcggcc cgggctggag tc^gtgtg^ Scaagctc 23' S 

aiii i is I E ~ =s 

iii ~- ~- 
s i ™" -~ ~- 2^^^s 

ggtgctgctc accgcgcggg cacactgcag gttggcgacc gcgtcctctc taaaator^^^ 9-7fin 
cScgt?S SS^r^' tctggggctt agt^^tjtcg ^aSctccga c^^^tcSc llTo 
^^^^^r ^ gtgtccagga gcctggtgtg ttcatctcca aggtgctccc gcggggccta 2880 
gccgctcgca gcggcctgcg ggttggggac cgcatcctgg cagtgaacgg gcaaaacata Itln 
cgggatgcca cgcaccaaga agcagtcagt gccctgctcc ggccSgcc? ggagctSca 30oS 
ctgctggtgc ggagggaccc ggcacccccg ggcctacggg aLtgtgcat ccagaaggca 3060 
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cctggggaga ggctgggcat cagcatccgc gggggtgcca aaaaan^.nr.r^ ^ 
cgcgacccca cagacgaggg catcttcarr- tnr-t^lZ^ ggggccacgc tggcaacccc 3120 
cgcgacggtc ggctgcg^g? gggtttgcga ctatlaall^ gccccacggg ggcagccggg 3180 
ggcctgacgc acggcgaggc gSgcagc^a ctSara^^^ tgaaccagca gagcctgctg 3240 
ctggtctgtg acggcttcga aaccaar^rn ^^""^^^^^tg tgggcgacac cctcaccgtg 3300 
attgccaacc ccStgcggS aScggc Sc™^^ tggaggtgtc cccaggtgtc 3360 
atcgaccggg agctgagccc toa™n?« ''^'"^^^^^''^ gcctggagag catctcttcc 3420 
ctgcactggg g^ccSggc cKSgaaacc ^.T.TJ^^^^ aggagctgcc tggacagacc 3480 
gactaccgcg ccctgjcc?c cgtgc^cagc aataaolaf^ gtctgcagcc cctgaagctg 3540 
gcagctggag ggaaJLggc tga^tctccc ?gc?ccSta gSccaaS ^^^O 
ccgccttctc cggatgagct gcccgccaat gtaaaaS^ gtggccagca gccgccctcc 3650 
gtgcccactt ctcacccgcc tgaggatgcc ccttaa^t^n ^^'^^'^^^SST^ cttcgcggcc 3720 
gcctccccgg agcagctgtc cttccaaaa^ cctgcccagc cccccacgcc tgggcctgca 3780 
gtgccccagg ccgagggL'c ccctSgcgc StcSSg ^^aggtgcgc 3840 

aagatgcagg aggaggaagc caaaaaaoh^ r.^- tgggtgctga cgacctgcgg 3900 

gcggcagagg ctggggccga aSagactc aa^.^t^^^^^ gagcgcagat gctgcgggag 3960 
gaacaggagg atgagcagcc JcccSccc larTn^^^"^ gggagacgct gggcgaggag 4020 
gcgtcccccc cgcccctggg Sgtggcgcc ccaat™ ^"^'^^t'^^^^ gcagagcccg 4080 
caccaggagc ggctgcgcat aallll^n^^ ccggtgcgga cggccaaagc tgaacggcgc 4140 
cctgccgagc Scgggccct Sgagaccaaa f^^^'^f^'^^^ cacccgagcg tgccctgtcc 4200 
aagtcattgg aacgScgc ?ctccgagca caaaS^^^' tgtggagggc agccaggatg 4260 
cggggcacgc gggggcccct ggagcgactg gccgSaccc ° ccaggaaggc 4320 

tcgcccaccc ctgtggaaga cctcgacccc c^^tnr. ^ttcccctgc gcccaccccg 4380 
ccggactttg ctgaggag?t SS^^cctg Saaccae?^^ cctccccggg acgcctgtca 4440 
gaggatggag aagtggcLt ggtgcttctg gScaaacc^^ ^f^^'^^'^^^^ cccgcaggag 4500 
gaagatgtgg cactgtgcag cagccaccac '^^^<'3gcgc: tgtgggccct 4560 

cctgtgccct cctajajgag caggcacc^c o^™^?^ ctgggcgccg tggcctgggc 4620 
tccagcacca ccct'gcccc aagcttJta acctaaatS ^^^^T^^^^^^ ccctgccagc 4680 
acaggagttc tggcctgtga ctaactaaS tagcatttta aagagacccc 4740 ' 

tgtaactagt gatgtt?g?a caaccaaS ??cta?tttS ITolltir ^^^^^^^^^^ ^800 
tgactacatt ttaaaaaaaa aagg ccctattttg tggtttaagg agaataaagt 4860 



<210> 57 

<211> 5079 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7506415CB1 

<400> 57 



iT^llTo ?gcaggtS gSSggfg T.ttir^f '''^''"^^^^ -^'^-^-^^ «0 
ctgctcctcg acgccaao?a gctgcgSaJ cta^c^^^ acagtcgtag cctcgaggag 120 
ttgagaaagt tgggcctcag tgacafcaa? ttfn ^ cttttttccg gctgctgaac 180 
ttcatgcagc tggSggagc? ScSta^cc tgcctcccga ggtggccaac 240 

atcaagttct gcLggc^ct ggagatcS glcttcaao. ^^'^'^^^^'^ cccggagagc 300 
cctgatggct tcactcagct gcgcagccta ggaaccccct ctccaggctc 360 

caggcactgc ccggggacgt ggcaaccS gccaacctaa taa^^T^^ tgtgtctctg 420 
aacctgctca agtccctgcc IJcgtccSg ?cat?tctaa Sf'^''^^^^ gctccgggag 480 
ctgggaggca acgatctgga aStgctgccf aBrlntnt^^ tcaagctgga acagctggat 540 
gagctgtggc ttgaccggaa ccagc^atca ao^.f gggctctgcc caatcttcgg 500 

cgcctggtgt gcctgglcg? gtcgSaac caacS^^^ cggagctcgg gaacctgcgg 660 
gggctggtgc tgctcactga ?ctg??gctg tcccSac? toc^^"'"^" tgagctcggc 720 
ggcatcggtc agctgaagca gctatccat? ctaa^^^^f tgctgcggag gctgcccgac 780 
gtgactgagg ccatcgggga ctgtgagaac ctcfcf^!^ accagaatcg gctgtgcgag 840 
ctgatggccc tgccccgSc cc?Sgaaag c^gactfaa^ tgatcctcac ggagaacctg 900 
cggaaccacc tcgaggcgct gccgcccaaa aS^^ tgaccaacct caacgtggac 960 

tccttgaggg acaaccgcct ggcStccta ^^^^^^^^^^ gtgtggcact cagcgtcctc 1020 

cSg?s; i ™S S~ 

sss?^ ."ss? .~c? ;l"o° 
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PCT/US02/39133 

tziiiiti s??Sc's sss sssss si?s«s r-"'" 
ssis ~ "« 

gactctgggt cgcccttgcc tgcagaggaa aaallat-n^f. f^^'^f ^rccc ttgccagcca 1500 
agtgaagact ctcgcccatc tgccagcacJ Itatata^^l tgagtgccga gtctggcctg 1560 
gctgaggcac agggtgggag ccaacaaaas ^ ?^ ctgagcccga gggcccgtcg 1620 

gaagaggact accIgggS cacggScS ttaartT^ ctggcgggga ggaagacgcc 1680 
gacagggaga tcgagjajgg gcaSc^SSa alrnnnf^^^ acgcactgct gcccggggat 1740 
cggctcatcc gcaagga??? aScacSc Itlllaatrt cgggaggcag 1800 

cagcccgagg ccgttgtggc tctgcSaa tcaagatctc caagctgccc 1860 

^ctcccgggg gc^ggLcaa tggcccSc faZccTaaa ^1^^^^^^ ^^^ccctgtg 1920 
gaggaggagg aggaagaggg tagtcctcaa alant^^^^^ ctcctcgggc ccagaaggag 1980 
agggctgaag aggaagajgc cagcactgag SaaaaS^? aagaggagga ggaggaaaac 2040 
gcgccctctg tcaagctgac cc^caccatc olllaant^t ^Sr^agggggc cgtggtttct 2100 
attgcgggcg gcaagggctc cacaScct^t lloaaaa^f.^ ctgggggcct gggcatcagc 2160 
cgggtgtccg aggaaggccc tgcggcccgg SStcc otTr^^T'' ^^^'^^^tctct 2220 
gaggtgaatg gtgtggctct gcagggcgcc aaSrSn^ Stgtgggtga caagctcctg 2280 
ggggccggca ctgccgtgca gatgSagtg llacaaat^^ aggccgtgga ggcgctccgg 2340 
gcggtcacca tcacgccjct ScggcccSa aaSf^f S^atggtgga gcctgagaac 2400 
ggggggctgc gcctgcccct gc??ccgcS iTlolrtZ ^"""''''^^^^ gcggcgggga 2460 
gtggcctgcc tggcacgcag cgagagSg SScttca aTJt "."^ tcagcgccac 2520 
tccacaccct acagggctgg tgatacaaar sh??? 5 f gcattgctgg tgggaaaggc 2580 
gctgctcacc gcgcgggcac actgSSt ggSccaca tc'^f .'^^ -^agggcggt 2640 
gacgtgactg aggccaggca tgaccag" Sc?ccctac taac^Jo^^'' taatggagtg 2700 
atcgccctgc tgttggagcg ggaggctaaa aao^nh^^? ^^^^ccgctgc ctcccccacc 2760 
tcctcacccc ccaccgc?gc ?S?gccacc accaS^h^ ctcccagccc tctgccacat 2820 
cctgggttgc cgagcctggc ccccagcctg ctaacta^r^ ccactgccac ccccggggtg 2880 
gtggaggaga tccgtctgcc aagagctgaj aalcal^Z^ cgttggaagg gccataccca 2940 
tccgaccatt ccagccaccc gStgStSc claa^lnnf^ g^cttagtat tgtcggaggc 3 000 
ctcccgcggg gcctggccgc ^cgcagcggc cScaaa^S ^*^^'^Sr"^^t ctccaaggtg 3060 
aacgggcaag acgtgcggga tgccacacac rt.^t^ ^ gggaccgcat cctggcagtg 3120 
tgcctggagc tgtcgcgct ggtgSSgg gacccaacao '^"^^^^^^'^ gctccggccc 3180 
tgcatccaga aggcacctgg SagaggSg ?acatcaa^^ ccccgggcct acgggaactg 3240 
cacgctggca acccccgcga ccccacfSc gagggca?ct tTJT^^^ tgccaggggc 3300 
acgggggcag ccgggcgcga caatccranr^ tcatctccaa ggtgagcccc 3360 

cagcagagcc tgcgggcct gfcgcfcggc TaacaT^ tgcggctgtt ggaggtgaac 3420 
gacaccctca ccgtgctggt ctafatn^^^ gaggcggtgc agctgctccg cagtgtgggc 3480 
gtgtccccag gtgtcat?gc c2ccc?tt acTclT.Z ^^^^^^^^ agccctgSg 3540 
gagagcatct cttccatcga ccggqaacta ^arn^t^^ tcggccaccg gaacagcctg 3600 
ctgcctggac agaccctgca ctgggSccc ^alaaat^^n ^"^'^'^^^Sfcaa ggagaaggag 3660 
cagcccctga agctggacta ccScglccta aagccgcagg tcggggtctg 3720 

• agggtaccgt ctggagcagc tggagggaaa Itl^n^^^^ ccagcgctgg cagcgtgcag 3 780 
cagcagccgc cctccccgcc t^ctccSat aaactS. °*^<=^^'^5<=tc ccctagtggc 3840 
agggccttcg cggccgtgcc cac^tcSc tcartf^ ^ ccaatgtgaa gcaggcctac 3900 
acgcctgggc ctgcagcltc cccggaScaa ctnttft^^ atgcecctgc ccagcccccc 3960 
gagctggagg tgcgcgtgcc ccagjcfgag alttaoT. ^^^^^^^^^^ gaagtacttt 4020 
gctgacgacc tgcggaagat gcaJgaSag Haaaatalt ^^7=^'^^'=^ cctggtgggt 4080 
cagatgctgc gggaggcggc agaggc^ggg accaaaa^S= gaagagagcg 4140 

acgctgggcg aggaggaaca ggaggatga? caaaa^fnnt ^^^'^^^^^'^'^ ggacggggag 4200 
tcaaggcaga gcccggcgtc ccccccgccc ctaoZa^tl ^^^^-^^^^^^ gagccccacc 4260 
aaagctgaac ggcgccacca ggagcggctg SStoo?^! ffcgccccggt gcggacggcc 4320 
gagcgtgccc tgtcccctgc cgagcSgg acccl^atn^ Srtccggagcc accggcaccc 43 80 
agggcagcca ggatgaagL a?tggaacaS gacacS^ ccgagaagcg tgcgctgtgg 4440 
aggtcccagg aaggccgggg cacgcggggj SSSacc aa.^"^^"' ggtcctcagc 4500 
cctgcgccca ccccgtcgcc cacccctata att^t^nt^ gactggccga ggccccttcc 4560 
ccgggacgcc tgtcaccgga ctt?gctSg Sa^JaaS f'^r""^^^ cagcacctcc 4620 
cctggcccgc aggaggagga tggaSgt? actllalfnn ^^^^^^^^^-^ atctcccagc 4580 
ggcgctgtgg gccctgaaga tgtgacacta ?Tr.t^^^ ttctgggcag gccctcaccc 4740 
cgccgtggcc tgggccctS ScSccS aggSaaa^ ^^^-^^ aaggcctggg 4800 

gggggccctg ccagctccag caccaccctt gccccaaa^. f^^^'^'^'^^^^ gacttggggt 4860 
ttttaaagag accccacagg agttctggcc lalaltt^l ttttaacctg ggtgttagca 4920 
agacctcggc gagactgtaa c^aStgltat l^fttt ^aactgcccc accccagccg 4980 

cnagtgatgt ttgtacaacc aaagactcta ttttgtggtt 5040 
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wo 03/050253 



taaggagaat aaagttgact acattttaaa aaaaaaagg 



<210> 58 

<211> 6065 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 72192179CB1 

<400> 58 



PCT/US02/39133 

5079 



???5aa\%^ ttTaTttii' ^"T-^^ ^0 

tgtagagcaa gacacatcca atgtSaaa lltatlt^t^ ccaatgtggc agcttttgcc 120 
atttaaggca accaacaaaa agcg^Sc tgSttSr ^aaaaaa gtcacaatac 180 
aatgaaaaca aatacttcca caggc?taaa l^tn^tlt ^'^^^^^^9^^ aggactttct 240 
caacttaacc ctgagaataa ta?™^f aatggcaatg gaaaggtcct tgaatccaat 3 00 
cctcctccac caccSc^aa tgcJSct Tltf^^'t -ccttccacc tccatctcca 360 
cctttgatga tgtttcctga SaaSggg tt?cttccct cacta^"''^ tcctccacct 420 
aaggctgaat ttgaaagttt tccaggcctc cctctto^^^ cactgtccac agagaagata 480 
tctgaaagag aaggttcatc gatg??tctS ccacc^o^^o ^f'^^^^'^^^t agatgagaaa 540 
aagccagcac atctcctttc ctcctctact atantl^ ctcctccaac tccatctcaa 600 
caatattccc aaaaagaagc c^cgaaSct caaaattctn f ^^J^^^^ra cttcatgcaa 660 
aaaaccggtg tgttgccacc tcccacattg codlla.l^ aggctaaaat cataacagga 720 
gataataaga acgatttttc ccccaaaat? at^tt aacttcccaa gcatataaaa 780 

tgtaaaatta ctacctcaaa ggatcagaaa Iaaa^?^^^^ cctccctgtc agatatggaa 840 
acggagacaa agcagaacgt ^fttagS aatctt^«?? ^^^^^^'^^^^ cagtgaacac 900 
gactctgcaa actgtctctc acacSaott aaagaaaaca attatctatt 960 

attgcgcctc ttataaaatc tcattcStt caltlaTatl ^^^^^^^^^^ gaaaaaacag 1020 
ccttatabga gaaacattta agacaccttt Itatalta^^ ^aggacaaca aaatccaaaa 1080 
aaaaagaaga aatgaaaaac agaaacaggj ;aat?c^^^^ tgaagaaaaa tatagacaac 1140 
agccaaaatc aacacataac agaggtggaa aagSaatar T.t^''^^^^ taaaactcaa 1200 
ggaggtttcc ctatctggaa ttgattcaaa a^^^^^ . "acaaaa. aaaccaatga 1260 
tcaaagtaat gctcggaLc taJgagtS ttltaaS.^ gttcaaccca gcccaggctc 1320 
agaaacagtc gctgccaaga ggc?ccacca ta^tttlnn ^^f^^^^^^cca caacatcgcc 1380 
gatgaaaaag gaagttttac alagctcaaa aSca^^«^^ ^cttcagaag acaaagataa 1440 
aattaaacaa agtcaccaag aatgtaSac claaSaac? caa^^^^^*^ cagcttgcga 1500 
gcagttgcac ttgccccaaa gcaaaccaat ttaaaatt^t caacagaaga agtatttgga 15 60 
acttccaact ctagatcata Lttaaatga aacagaccac llTt^^^^ aaaccatcaa 1620 
gcaatctgag attgatgttc aaacctttac c™«^! t^'r^^^^^^^ gtcataaaca 1680 
tgaagcaagc actgaaEgta gtcataSa ato^o^ ^ tatctgaaaa ccaagaaaac 1740 
taagaaggag aaaagag^ga Sgtacaftt gccSS ?ccaf " -tcagttacc 1800 
agataagctc aagatggttc ccaggaagca aaaaaa^?^^ ^'"'^^^''^^^ agaaccagga 1860 
acttccagga agtgaagaaa aaaSSaggg ^catc^at. ^^f^^f '^^^ acagagggaa 1920 
attaataact gaaagaaaac acgaaca??t Saa^ataa! f^^^'^'^^raa aagaagagag 1980 
gcaaaaggtt atcgatgcac atcttoat^r 5 tcagcaccaa aggtcgtcaa 2040 

acagaccgct gaalgtfaag ctg"cataa aaalttS^ aattttcagc aaacacaaat 2100 
ggaagaaaaa tgtctcgaag tcaaaaacat trtt^^ cagccatata atagtctgoa 2150 
agattcaaag caagagftta cacagaaca^ atct?^o^^^ ^^^f ^ttct ctaatactaa 2220 
gcgggatgat ggaaaaggtg cctta^^tat ^gtgga^t^c SSaaS ^f^^^^^^ ^280 
gcaacagatt ttgtcgagag tgaaacaatt toa=„!f„=° ttgagaaaac gtgaagaact 2340 
aacatttcag acactatLa aLctatccc SaSt^ ccaaataaaa gtggccttaa 2400 
atatgcagtt cacattgcca tggagaataa t??«^f= ^ ataagtgaag ataagagaga 2460 
tattaaaact caagcggaag a^agcttgt gtccStg:: llrTT.^ aaataacaca 2520 ■ 
gatgtcctcc aaaacaggaa aaccSggaS Saacccfct aa^^^f agacagccat 2580 
caaagtatct aatgttcatg tcagcaataa taaaaa^^^^ agtcttgatg aaacatcatc 2640 
tgccaaagag aaaacagtac agcfccaaat Jan^^o^^^ gaacagaaag aaaataaaat 2700 
tcacgtgaaa acccatS^gg Sttaaact t?ataa^^ "^^^^^^caa ctgttcgtag 2760 
aaaaacacgc ccaccgtcac caac^tttat cao^If ^ aacattcctc ctccctcttt 2820 
aaaccctact aagaacgagc t^tScaatc c^o^f tctactgccc gacgaacaga 2880 

accaagaagg cccatgLgc aaaaatcSa aattcaca^? gacagttatg ttgaaccccc 2940 
acccaggagt cgctctgaL aacttgtaaa ^r.t^^^ gcaaacactt ccccttctcc 3000 

aacttgtaag actcaaagac accactgcaa agttatccaa 3 060 
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tccacctaca atcacaatac cagtaaatat aaa^^^^^^f ^^f^'^'^^^tg gtagggactc 3180 
atctgtggac gctcaagagg aaftcagga^ agtggaoaaa S^^^^^'^ ccttcagaga 3240 
agatggacta aattccactg atcaca?gg? gccSacacfc ^f^^^^f^" ^tgttcataa 3300 
aatcatccgc aaggttgcag tgcctcc?cg cctSac ™f ^tgcagttga 3360 
ggccaaccga actgttcaaa tggctgaaaa tSataltf gatatgaagc 3 420 

cagatggttc agggaatttg agcatggcJS llTtllfltt ^^^^'^'^^^^^ atgaaataaa 3 480 
ttatgcaaag ggagaaacaa accataaca? acaacaaaa^ gcaaagtcaa atagaagagt 3540 
ggaatttgga ttaacatctt taggaaJcac gaattt^f^^ agtcgtacat tttgtaagga 3 600 
tagagaactg cgagaaaaga ttcctgttaa gcagcccaaa atnf ^<=^aacatcc 3660 

tctaagtgaa catttctcag gcatggatgc IttfaZt^f atctgctctg aaaccaggtc 3720 
gaaaacctct tcatcacata ?ctcaSgc £gSa?S S^f ^^'^^ agtcgaagat 3780 
cccaccaacc tatgaggatg tcattgctgg aStatt^h^ ^gctgtgact tcaagcatgc 3840 
agaagtaaga aaaaattttc aaaagacg?g gca^aaa^^^ Satatctctg attcacctaa 3900 
gggatatgca accgcagatg cttc?gcaac SaataaS^ ^^^^^^^^^tt ttaaaggcct 3960 
tgcatttata agtgaagctg ctgctccaaa accaccttcc aagaggaatc 4020 

cagtttatcc aatggagtgc ctagtgScaa a^^^^^ " ^^gtatactt tgtcaaaaga 4080 
gatgccacca ttgcaacagt aiaSagt? SgaaSta lllT'^T ^^'^^^-'^tcc 4140 
tatactgtaa acctcacttt aaacaacttt ttlllltrlt ''^^^^^^^tt catggacaaa 4200 
ttggacataa gcagcataaa gatagatgj; actgcaaaaa ^^f ^""^^ gatgaaggtt 4260 
ttattcctaa tgaagaacca aatatgtgS aaalta??^^ ccaaagcaga tcagtggact 4320 
gagatcgtaa tgaacattta gatgctggta aca^r^^=^ agaaaacacc cttgtacctg 4380 
aattagggga aaggggaaaa ?ta!aagS ttScctcI f^""^^^^' Sratttgagai 4440 
aaaccttacc ctttgaggaa gagctcaaaa tal^f ttccaaggag atccctaaga 4500 

caaccctgct atcccc?gaa ?t?aaaagtg aatcS^a^^ taagtggcca cctgaaatga 456? 
aaaataaagg acaaagacaa gatcact^tc cattttSn» ^^^f^^^gtt agaactccag 4620 
atgtttgtca gaaagaggat gttatajgaa tSaaalat ^^^f '^^^ cagtccaccc 4680 
aagatgaaaa gaaggaagga aggaagaftg tgcaaaata^ ^aaaatgcct gaaggaagaa 4740 
caaagagtaa caggaaaagt gctatlgatc t^aafca™ ^°*=Sragtgaa gctgaagaca 4800 
gtgctgaaaa ggagaaaaat gaaaaScta ^ccaif^^f °^^5^^*=Stg attgtgcaga 4860 • 
ttactaacac tgatgatgag atga?gccag SaaatcaS "^^^^^^^^^^ gttttacagg 4920 
ataataacaa ttatgtagca gtctcSt? tttttlftt^ agaaaatttg aataagaata 4980 
tagaatttct tgatctatta cccttgtcga atTalar.^^ caggcagaag acatctattt 5040 
aaattgagaa gttagaaaat acatc?Sa S?caaaat^ f f.^^^^^ca aatgaatatg 5100 
aaaagactta ttcgaggaat gtactagcaa acttggtata tttgaatctg 5160 

ctgctggcag tcctgtjcag cc?gc??caa Scaaacct ^f^^^^^^*^ ^acagagcaj 5220 
aggggggaag ttcaatcatc tctcctgata caaatc?ct^ ^^^^ff^^^ cttatggtaa 5280 
caaagagcaa aaatttacac tttttc?ttt ctaaca^^^^ aaacattaaa ggaagccatt 5340 
agaaaaatga gaacattttc aattgtgitt taataaaS f f^^^^^^^ gcattttcca 5400 
tgccatgctt ggatttaagg gaatttgaaa Ja^^^^f^ tgtagatcaa attaaaaata 5460 
cagaagctgc ccgcaataat gaaaacSg gt^tS^^^ accttggcat gttgaaacaa 5520 
ctaagccttt gtttcccaga gtggaggtgc fatca^Snf ^^^^^^^^^^^ gaatgtacag 5580 
ttaaaagaaa caggtgctac iggacacg agtaaS^a? f^^^^^^ gaagagcaga 5640 
cacttaggca ctgagagata ttgatgttc? gaalJaaaa^ tgacagtcca 5700 

tttgaggaac ttgatgtaaa catggSgttc fa^ah^?^ tttatgaatt tggataccct 5760 
ttcttgtatt acaccttgtc attStSca Satt^^^^? ^^^^^^^^^^^c aatgggatat 5820 
tggaatgaag agatgaaaca ctatgSat gtt"ccat^ ^r^'^'" "gtttgaac 5880 
tgttctgttt tcctgtaaaa catcligggt SgatttSa ^^^^^ tttagcatat 5940 

attattataa cttctctgta atttccS?g aaS^aa^^ '^^^^^'^^Srct gcttgtcaca 5000 
caaaa aaataaaatt gaatcacctg aggtgcaaac 6060 



<210> 59 

<211> 1055 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7505908CB1 

<400> 59 



c^=c.=„, ^^^^^^^^^ ^^^^^^^ ^^^^^^^^^^ 
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Sagttcct gSStgaS tttcgaccgg acSaS? tTT^^'^ aagttcggcc 120 
attacaagaa tgatgtcatg atcaSs^a^^ 'icgggaagtt aagatatgcc aacaacaaca isn 
acatttcttt tacaacatcf S?ggt? StSt'Sa fT^^^^^^ agagatS? 240 
cagaaggctt acgagtattt tattatlSg tcca^L^^f tgtcaatcaa tccaaggatc 300 
ttattggatt acacttcaag attaaa™ ^^^^99acct gaagtgtttg gtcttcagtc 360 
Sgtgggtggg agtagaaaa^ tttgtgtata talT^^^t ^^attggtgt ggacatgjg 42^ 
aS"^'""' taataaatat gtgatLS? ccSSSaa f """-'^ ^aactataaa 480 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a™ aaaaaaaaaa aaaaaaaaaa 540 

aataaaaagg gaggcacaag agttgSaS JtaJaaac^" aaaaaaaaaa aaaaaaaaaa 600 
JiL^f Srsaacaaggt tcccLga? ttSSaaa? aTf^.T^^ ecgggatttt 660 
aaaaaaaagt gcccagggga attttttaao gagatttggt aaaacctggg 720 

aaaaaagagg ggtaaaacaa ggggtgacct afa^f ^^^^ ccgcaaaaaa ttcgggccgS 780 
aaagattggg gtgcggttta SJcggggat aacataaaf "^^^^^^^aa taaa£tcgca 840 
tgagcaagat aacggggaaa aacaggStt aSaaaS^ gacaatttgt ctaaaaaaga 900 
gtagcctttt tttatgcgga aaaaggggqt aIcro^^^^ ggccacacgg ggggaaaggg 960 
taaaaggtgt ccaaaggggt ttccccgga^ tgccf Sra^^ttcccc agagggtct? l02o 

<210> 60 -"-^^^ 

<211> 1651 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> miac_feature 

<223> Incyte ID No: 6590147CB1 

<220> 

<221> unsure 

<222> (1, ... (1651, 

<^^J> a, t, c, g, or other 

<400> 60 



TcTaSil ^ ^tggtaacgg 60 

nnnnnnnaag gtagSaS cafa^^^^? ctctgccnnn nnnnnnnaS 120 
atcatgaagt ttocagggcc tttggaaaac nt^^ f^^"" agcaggcaga tccaaaggat 180 
atcactaggg aagcacljat gtgSaaata ^^f^"^^ ctttcctgtt ggaaaa?gca 24? 
T'r^T ^-t^5-9ttg ?a?tgSt? T.ltltTtt aaatcagS bSo 

gaggaagatg agagaattcc agtactaaaa tcctagctgc caagactgtt 360 

tcctcatctg aaattttgag aatgg^aaaa ^.It .^^""^^ Sragacagttt ctg£gga?gt 42^ 
cacacagcca caccatt|gl ttSct?ca^ ataagttgaa ttggJ^St 480 

cctcagttac ttttcag??t gcccSattg llaJtt^l^ ccattgcagt gtcaactagg 54^ 
aagcaactac ttcactgtat ggcctg™ attn^ff" aacatttggc agtcctta^c 600 
gctctggcca tggttagtct JgaaSggaS aaao^^^^^ ^^"cagagg atccatgctt eeS 
attgaactge ttcagaaa^c SStgga? aar^^^"f ctgatt^ct ttctct?aci 720 
gtggcacatc acctttctac tctgcaatn^ t^^^''''^^^ tgatccattg tcgggagct? 780 
cgtcccctca agcacaccct ggtgaccS afr«=''°''"' "^^^^^tccgt ttagtctac 840 
tctgtcccag gcccagactt SccaagSc Scaacf tgttcagatt acatccctcc 9oS 
acagcagcct tttaccatca tctoccaach ^^^^^caagc cagaagtgcc agtcagaggt 960 
cgcaaagtag aggaaatgga igtggaSac ttatf.^T ^^-^5<=^gac ctctactSa 1020 
gaagataatg tctcagaaaa tgt??JtSt llalnr^^^^ gaatcaaacg gctctataat 1080 
ggacatgctt ccccttgtcc acc^tScaa cc^af^^^^^ fgatttatc aagacaagag 1140 
ctacctttga gtgtaaacta aggtjgactf ctS^ tcatgtagtt tcaacaagt? 1200 

aaggctgtag aaggaaatat aStaacaa acS^ff^'' gagaacatgc aaaatcagga l260 
cSaaaaL'a ^''^^'^^'^ tggeSS ?aScagS tSSf '^'^ aaaaaaafa'a IIS 
tgtJata?^? aaaaagaaat ccaaaagatc caaactt??? S^^ff^^ acataaagac 1380 
f^^*" cagtggccct ttttttccgg ccata^^^^= taaaacttaa aaaatctctt 1440 
aaa^r ^"t^^t^gc agaattctg? ccccctS^ ^^f ^^^tgg ccccgtcaaa 1500 

^SSr^"" attaaagtac cgactttggc ggggccatta ^^^^^^^^^^ tccggggggc 1560 
tctgtgaccc gtcgtcctcg tgaggggocg gtgccgcgcg agtgcacttt 1620 

<210> 61 ■'-^^^ 



81/89 



wo 03/050253 

<211> 2015 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6828539CB1 

<400> 61 



PCT/US02/39133 



^^:PI a— S S-S- ----c a..a.cacc. SO 

?ScSS tSSS^? SSSe^S l~ =S lis 

tcatggatag ccagggccga TatgSTa SStact ^agttggccg 240 

atgtcacgct gctcgacccg gcctccaaaa a^n^^ ^^'^^'^^^^^g tctcccgacc 300 
tcgtgcgctg tgacgcggtg atgccaccca all^^l^^^ gtcgtaccca ctgggcgcca 3 60 
gccaggaacc cgagcgcjcg cagcccgac? f ^^"^^^g ctgctcgtgt 420 

cggagctgat ccgagaggac atccaggggg c?ctaca^«! ccagggcctg cgcctcgggg 480 
agcgcagggc ggcggcgctc agggocfcgc laalalt^tt ttaccgctcg ggccgcgggg 540 
ccgctgagac cccgcccctg cagcgccgco catcffh?^ Scagcgcgac cgctcgcccg 500 
agcggggcgc gggccgcgga cgfccccagg cSaSca? "^^^^'^^ agcaccgtag 660 
agaggcctga gccggtgggg acctcgagS SSgactc aarT^^"^ ^aggaggcgc 720 
cccggggtcc tgacctggcg gttctacaaa ggcctccccg gacctgggtc 780 

tgttcgacga cgtagagagc StgScaf agtggacatc ctgaaccacg 840 

tgctggagca ccggfaacgc ggccacaaaa ^o.^*"^^^^ gtcggcggag gcggccaggg 90o 
tgacgctgcg ggccLgccg ccctSSg ccgSa? Sfgcggctggg gagggct?g 96? 
agtacgcctt cagcctgctg gcccggctg? SSocaao^^ ^^^^^^^^^^ cagaagatca 1020 
agctgttgca cttccttttc gggcc?ct?c S?aa??^^ cgccgacccc tcctctccgg 1080 
agttcgcgag cagtgtgcgg cggccgcatc lalUlalnt f ^^^^S^^cg ggggggccgg 1140 
acaacgtcac tccacgtgaa aacgagctc? SccSn^ ""^""^^^^^^ ctgctgcggg 1200 
ccgggctgga gctgtccccg gaggagggac cccSaScaa f ^^^^^^'^^ tggacccgcc 1260 
gggagccgcc ggtcactgac ccScaSgcc gcacctaa.? ^^^^^^^"c ttcagcggct 1320 
tacagcacga gcggaggcgc cgJcaggSa cSaaca?^ Sgacccagtt gagaaacagc 1380 
aagtccgggg gcgcggccgg tccgcSgJc cccjcctaar ^^^^^^^^^^ cgaggctggg 1440 
agcaaagcgc cccccaggtc actotra^^^ cccgcctgac ccgactgtct tacttcctac 1500 
ctcagctgga gtcagaSca gcagSSa? ggtccgS taa'??T^^ ^-tctgagc llTo 
gcaacagcag tgagctgtcg gtcaagcajc ??Scatarr ^^^"f Srac ttccaggccc 1620, 
agaaagtgtc agagctggag gcagtgatjg aafaaSaa^ ggaggtcctg gaggacaaag 1680 
tggaaatgga ggtcatttga cctgccaggc gccc?tca^^ g^agaaggtg gaaggcgagg 1740 
ggagaacgga ctcctcagac tctcccrn?^ srcccttcgca aagagtgacg aggccccgtg 1800 
atctgctccg gcctggtctt ^cc^c^tccc SacfSa ^^^'^"-^^^ aggatggcca I860 
tccctccgga gaggatctgg actggctggg agt™S an^t^''^^^'' cttgctgcag 1920 
aagcgcctag cagacctacg agcgjggcag SaS ST^S^ggagac agtctacgga 1980 



<210> 62 

<211> 1760- 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 7170321CB1 

<400> 62 



StSa ToTcltT^ gSaS llilT'^^^ ----^-^^ -tggactaca 60 
cggccagtct cttccgc?gg cggcatSgg cccaaataaa ^""^^^^^^^ aacatcgaca 12 0 
aggagaagga ggaactggac aggggctS gcgaSScaa f ^'^^"^^^^ cagttccaga 180 
agaggaaact gaaggagctg gajgtggccg l^aacaa^^t ^^^^^^Srgtg gccgagtgcc 240 
aggccgaggc acagcagctg cgSaggagg agcggagcta aaa^^^^"'^ gagcgcctgc 300 
tgcgcaagaa ggagaagagc atgccctgga Icataa.^l. ggagcagaag ctggaggaga 350 
gcaaggccca tgcccacgct cccgccacc? taata?^^ ^ gctcagcaaa gacggcttca 420 
cgtggatggg atgggcgctg cgaccSgc ccgcccgScc oTaT^'"'^ ccacctccgc 480 

use ccgcccggcc gcgctcgagg cgctccgcag 540 



82/89 



wo 03/050253 

PCT/US02/39133 

ccttgcccca gcccactccc cchnhr.=,r.^^ 

agaagacgga ggaggactca gaggaggtga aaSaac^f^ catggtaaat accaagcccg 600 
aatacgagaa acagatcaag ?actt?ggca tgcScac^^ acacaagacc ttcgtggaaa 660 
acctgtcaga caacgtccac ctggtg?Jca aaain^^^ =tgggatgac agccaaaagt 720 
gcattgacct agaggtggag gaSatgtg cacStaaa ^r"^^^'^^ ^tcatttgjt 780 
tcgtcatgca atttatcctg gagctggcca a^^^^^^^^ Scaggtggcc caccagacaa 840 
tccggcagtt cttcactaag !t?aagacag SaatcS^f gsrtggacccc cgggcctgct 900 
acgagctgga agccttcaag gagcgtgtac aaSn^^ gtacatggag ggcttcaacg 960 
ccatgaagga gtacgaggag gaggJgcS aaafa™^ ^^^^^^gcgc atcgagaagg 1020 
ccgtcgaggt ctacgagtcc ctcccjggg ScaaL ^ggccccggc ggcctggacc 1080 
tgcagatgct gcaggacgcc atcagcaaS taalnZ^ ^tgcttcgat gtgaaggacg 1140 
agcgctgcat tgactctggc ctctgggtcc ccaactctan "^"^^^^^^^ taccacatgc 1200 
gagaggaggc aggtcctggg gacccattar 5 ggccagcgag gccaaggagg 1260 

aggatgtcag tgtgtgacc? gcccScta S ^^^^^'^ tcccaagacg ggcgatgaga 1320 
gccccttttc agaaaacaga Satgccat ctaa2^""^ ctgcttccag gcccctftgt 138? 
ctgctcggcc cagcctgggg ggcccgccc^ gccctcccta f^""" tctacttg?g 1440 
ctccagcgcc cattcaagtc tctacttto« ^ gcctctccac tgtctccact 1500 

atcagcatta tgccaaaggc ccggggg^cc SagSaS^ "cactgcct gcagcccccc 1560 
accaggtagt tggggagggt ccc^ajccaa gaac™?^ ^Sraggtcacc aggctggtct 1620 
tcactgttcg tctgctgtct gtgtcttcta ttlaan^^ ctcgtcactg ggctctgttt 1680 
agatttcaga tgctcaaaaa tttggcaaac agcaatgatc ttccaataaa 1740 

<210> 63 
<211> 2599 



<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505918CB1 

<400> 63 



s~ ---- — - 

ga^cSaa^^? --S-^ t^^S^g H'o 

cgcacgtccc aggagacgS agSgtSaJ ^^^^'^'^^-^ taccctaacf 240 

gacatatccc acatctccca gigattggaj aafctatat^ tcctcgggtc tcggggactt 300 
cttgagcctg tgaaggacac tgacattcaa a^ntt^ f ^ cagccaccac ctttgagcct 3 60 
ctgctgtctg ccatcgaaga gtcccaaaa^ ggcttcctga agaatgagaa ggacaatgcc 420 
cgggagtcaa tgttgjttga gtgggagcaf ataaS'''^ ^catggctga gjagtaccat tlo 
gcatcaggag aagacgccct tgfc?t?act cafo^? gaattctgca cacactgctg 540 
litTJT ^^^^^-^--^ ctctctggat aacScSS Tar'tT^ catcag?ga? 600 
tatatctata atgagaaaat tgtaaatgga caccS^ *=fSr=c'^atsr«= gcggcaaatt 660 
gcttccgtcg cagagctgga tgataagagc atfcf^n^^ ctaacctggt ggacctttgt 720 
atgacagaag tgttgttgac accggcfacg gataccct^^ tgtggaccat ggtaaaacia 780 
gtgcgcatgg agtttgtcag gcaggcott? atatt^^ff agaaccgcag cagcgtggaa 840 
acccttgtga ctgtctttgg aaa^Sgc^t caScccaan f^"^^^^"^ taagaattac 900 
taccaattgg ttcgaagttt cctgaacatt aaf tgggcggggt gcctgggact 960 
gatggagagg tggaaggcca tcctgtgtgJ acacS^^^^ ctcccttgcc tggactacag 1020 
gacctgcttg ccgcttcaca ggtaatSf? actactgcat gcgctgtgga 1080 

acctggttcc aggagtacat gfacfg^aa^ afcaa^f'^^ accagctggg agagtStSa llTo 
aagctccggc tgcattaccg Lgggccctc aaaaan^^f tgtccccagc tacggaaaac 1200 
gtgtactgta tcattggcag atggacgtc acSacaaS fSratcccta caagcgggcc 1260 
?c™''^" acctgtggct ga^gttgSc cStgtgtt ^^^^^^^ ggcggacaaa 1320 
tccccacaag acaggctcac tctctcacaa t^o™? ttgacgacga tggcaccagc 13 80 
gagtcccact ttacggtgaa ccagcaaccc Setoff ^ agttgttgga agactatggc 1440 
gcgcagtttg aagcagcagt tgcct^St ^^on^ tccaagtcct gttcctgaca 1500 

gtccatgtag cactggtgct gttSSJ aSa^f? f^^^^^^gcg ctgcca^gct I56O 
gctcagctcc tcagccacga gcctgSS cc?cc?tac^ ^r"^'^"'" tggacagagt 1620 
cggctcctca tgctgtacac ccggaagttt alcrtn^^^^ tgcggcggct gaacttcgtg 1680 
tacttctatt tcctcaggga tggaJSt aScaagafa ^^^^^f^^^ ggccctccaj 1740 
gtgagtgagc ttgtgattga aagccgajag tt^SS a^\~ 1^0 



83/89 



wo 03/050253 „^^„ „ 

PCT/IIS02/39133 

llaiZTtl cttSSc - gtgacacaaa gcctattatc 1920 

gaccttgcca agaatgc^ga caagSc^a ^agaggcagc aaagctgtat 1980 

gtcccccaga tcagtgcccc gcaatccaar Jf^'^^^^*^^^ acaaactgct gagccctgtc 2040 

attgccgaac ggtftSggc tcT.T.Ttl ^ggcSa^a SJSt^S ^f^^^^^^^ ^^^^ 
tatcttcttt tggacttaat rar-r-h^t-t-^^ agcgcaaata aatttgtgga ctccacgttc 2160 
gcttttgata tcStgSg ctSagctS ?tScc2L' tattgataga 2220 

agagtggctg ccttcagaaa Itlclatall alltttntfn ^"^"^^^^^^^^ tgtggaagag 2280 
attacctttg ctggaatgat accJtaccS gaagtcaagc ccgcactctg 2340 

cagatggagg tccJcatgai SSgcca ?a?ttS?^^ acaccaatgc gaggctggtg 2400 
tcagtacatc agacacataa nnnn^r-f tgctttgtgg gagtctgggt cggcacactg 2460 
tttgttt^gg ttlctatlll Itt^ttltff ^I^^^^^^^^tc tggttttgtt tctgttgtg? 2520 
tttcaaaaS aaaaaaaaa gtcaacgcca ataaatttct ttgatttgL 258? 



<210> 64 
<211> 3305 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505935CB1 

<400> 64 



cgggacagag gcagggttct gagggcaggg attccccctc ^rt-n^^rT^^„„ 

s?s s= S2?s?? ~s i SF =s= 
2= ?S H r 

ctcggtccat ttcagtagaa gggagctccc tcttaat^^ gaactgcagc 3 00 

cagatcactt acaaagatgt gStltgaat attcacSc Sttttc?^^ ^^^'^'^-^^gg 360 
gtttggcaaa agaaaaggta tttactitgc aggatctatt acaactca^J ^^^^f^^^"^^ ^^0 
ctacttccag tctctacaaa tcactggttt caaaatcKa t.^^ . aaaatcaacc 480 
ttcttatgca ttttttaaaa gaattggcag aa?atcaSa ^^^^^^^^^^ caaaaaggtt 540 
tggaaactca gacaagttcg acatttfaca ! agctaaagag agttgtaata 600 

ttgatgatca gtttgcaaa? aatlltn^^^ gagattctct ggctgagaag cttcagctta 660 
agctaLtga gtSaglga jSatagaaS aacSJ^" ^"^^^^^^ct ttagaaataa 720 
tgaagttttt taaagatacc Saataar^? agcaacttcg ggcagaaatg tgtcaaaagt 780 
aaaaggagtt aaccftg?tc caaaataa^^ ggaagcaaaa aaaaagtatg 840 

atctcgt?ct tcgggaSag ag^accctS aaaaaat^n^ "^^^^^^^^ aaatctgaag 900 
caaaagaaat ttatgctcaa aaacafc^h? f^^S^^^tca caagcaccaa gagattgaaa 960 
gagaagcaga gctgaagcaa agagttgaag ctt^taaa?^ "^^^^^ratttg ctaagaggaa 1020 
aaaaacataa aagcataact aa™of^= <=ttttgaatt gaaccagaag ctccaggaag 1080 
aggagaccta tgaccgaaag ctcSgaata aac?tc?f ^ ^cagaatata aagagttttg 1140 
atgactacat cittagaact aatcglcJal ttall^lt gtatcaactt gaactgaagg 1200 
ctgT:t:cattt gcaagaggag ctca?aa^?^ aaggaagaat aaagaaaaag 1260 

ctgtaaatcg Egtgaalgaa cttgaactS Ittt. aaaggaggaa ctcaatcaat 1320 

caataacaaa acalaacSt atgctSS aaJS? tgtcaaagcc cagtctttgg 1380 
tactaaaaga agagaaactg aaacfc?r^^o ^^^^^^"^^ agagatgagt gattattcac 1440 
aagagagtig aaa?gaSc SSctcc taa^ccL^t f tacttaaa caacaactgg 1500 
ttgcagtctt tcagaaagaa ctacaaaaaa It^^T^ agctcagccg gctcctgaac 1560 
agttcgaaag ctgcaggcaa gctctgcaci aacaao^^''« tatagtggtt gagcatgagg 1620 
cacagctgaa ggcccagatt tllaaftt^t aacaactgca agacgaaatt gagcattctg 1680 
ttgccga?tt aaaattSa cSagcaaa ctca^af^' aaagagttta actactcagg 1740 
gcaatccaaa gcagtc?gtg atcaa^catt rt^tn t^'' cctagagaat gaagtgtact 1800 
tgccttgcaa ?gg?gagata SSSa" tc?SacS ?cct^^r"' ^^^^^^^'^^ IS^O 
ttctagcacg tatggttgca tcalaa^tr-l f tccttttaaa caggaaaacg 1920 

gttcccctga ttcgacctt Sg???aS t^ T^l'''' aactgcatgg gtggagggta 1980 
agcaagagjc cgaacgct^g jSagct? tcaSSt^ aaagagcttc 2040 

actctgccaa aagcccacta ar^crn^^^Zl ''^^^^^^^ta ccatcggaga gtcattaaaa 2100 
tcaaaaacat taSccagt tccccgSaa llTtlttlt .^^'^^'^^^"^ ctggaagcct 2160 
ctgagcagcc tcaagtgggc ackcttaa^^ ttT t ""^^^^^^gac agagttgtct 2220 
gcagtgcagc ctcglgg^c cgcgaoaacf ttt^.^t^^^ cgtcgtggaa gcactgacag 2280 
H^ggctc cgcgggggca cttcctccag acgcctctct tccacacccc 2340 



84/89 



PCT/US02/39,33 

ac^tSSc TcTclTcTt tgtcctSS "^^f^'^^^^^ Waaggtctg ggcagatcac 2400 
ggcacagcct ctcca?ccc^ cctgtScca a^^.f^^^' accatcaccc actgagtcta 2460 
gaagacaaac tgaacttcaa Sacaaaaata TJtltn^^^ gcagaaagtg ggtctttatc 2520 
aggata'atga ggagtttgaa ^SSStSS aatotata^^ tgtggacaag ctagctttta 2580 
aaaaggaaga agaaaaaata cgggaacag? SSaaaaaa IrltT""^^ attgaagaac 2640 
gaaggcaqaa taacctara^, r^f^^^r^f ^^Stgaaaga acgaaggcag agagaagaaa 2700 

drs .€ i SiF- fir. 
•sss: ~ S SSK .lis 

iif- 

sr"- "« """s? ss= ™- iiti 



3305 



<210> 65 

<211> 2923 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4225965CB1 

<400> 65 



I™ SSi ill Sis! iiS iSS i: 

gcctagtggS acccaflctg llaZaaan^ cccccagccc ctcagggggg ccagcgaaga 420 
ccccccgggg atcctcaS cltfar^nnn agtggtgctg gctcccctgg 480 

ttcttcc?cc tcctcatccc cctcaa«^^^ cccacagcgt gctgacacgc cgcgaagaaa 540 
ctcctccgcc aacctStaq tacaa^co^^ Scctcggcag aagctctcca ggaaggcaat 600 

sss L^ssi I- 
£ ~° -~ ~s ~I js^s 

~ ii i! =1 iSI ~ 

aatgaagctg gctgagatca agagtLIaa tgactcagS ?Sgccg??g gctfcaaccc llta 

Siil =i £= iiii = 

tggagactca gaggcraaaca t^n^^a^^^^ tgtgttcctt ccggatggag acattctcac 1500 

?ss sEi ~- ""ss 
Scss ii -sss' s??e?s ^52s 
c= sH i~ ssss iz 

r-rrr,r.o=„ Hctgccccggg ggctctgcac acacccctcc cagaaccgct tcctranntrr iQ^n 

cScctcS SSacSgtc iZllllll -tgSca? I'lsS 

aggactgaac acggggSt aa^K2^^?^ ^^tccacccg agtggggcag ttgtggccgt 2040 
gaac acggggaggt ggttggtttt ggacacagag accagagaga tcgtgtctga 2100 



85/89 



wo 03/050253 

PCT/US02/39133 

SStS^ tlTcll'^lT. lliTtT^l ^^"^^^-^ 

atccagccgc tttggccgct gtatgggtca ctccaa^^h^ gtttccagtg atggtgccaa 2220 
caaggatggg aatttcatca tgtccalttc taaaat^lt^ atcactcatc ttgactggtc 2280 
ggctggaggc tgcaagcagc tgaagaatcg SaSaaoc actgggacgt 2340 

ctacacctgt gtgctgggct ttcacatnh? nr,t^^^ cgagaccggg aatgggctac 2400 
gccgcatgta cgggggSac ggcScaS t^^ ^'^^'^^f cgctcgtgcc aaggcgccga 2460 
cgcacctcgt ctcgSc Igcllaalca ca^a'^tfT. ^^^^'^'^^^^^ cacgacgact 2520 
ctgggggcgc ggggccjgcg cccacScoc ccagtggcga gtgctgggcg 2580 

cctccctcga cgtttgl??g ctScctaaca n^l ^^^t"" '^^'^^^'^^^t^ tcccccgcct 2640 
cccgccctgc ccttc?ctgg cSaatccS alTf.T ^ gcccggcggc gtggccccgc 2700 
gacttcgaga cattcccga? cgcStttt oohn^ ggccgactct ttcctggact 2750 

cactgtttag acccgctggc Sagccgaac lat^r-^r.^^^^" P^^^'^^^^^ cccctgcaca 2820 
ctgctgaggg gcaaLaa'L SaccSa^^ HTcHllll S^^^"^^ '^^^'^^^^^^^^ 



<210> 66 

<211> 11545 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> ndsc_feature 

<223> Incyte ID No: 7495594CB1 

<400> 66 



L'Siaagal Sga^^'ga gSSSga ^^T^^^ ^agagggaag 60 

ctgctgagga gtctggattg tqtcta^cr^ ^^f^''"^ ttccctccgg agttcccgag 120 
tggccgctgc cccgcjccca gctccgcgca cacS^^f tgaaagggcg ctgaggctct ISO 
ctcttcccgc ccgagccgcc atctaalntn ^Sf^ccctctg cgagtccggc cgcccagcgc 240 

tggcctcgcc tggccttttg ?tgc?S SSc^ ttlTa^^T ^^'^^'^^-^^ ^00 
cagcagatgt ccccgtcacCT caa^-^^^=^^ ctggcgctcg tttcgggctg ggcgaccttt 3 60 
gcccccggSa gtatccSgc Sgcccgct cctScaa^: ^ccccgagac cgcgcccggg 420 
gagcggctgg gccaggcgtt ccggcga^gc atacaactS^ aagcggcggg gagcagagtg 480 
ctggagcttg tcttcctggt gga?gattca trr-. J- l^ tgcgggagct cagcgagcgc 540 
gagctcatgt tcgtccgcaa allaclatrr n.^tf^ '''' gcgaagtcaa cttccgcagc 600 
gtggcoatcg tgacct?ctc Scca^aaac f^r.^l'"'^''^ tggtgcccac ggccaogcgc 560 
acccgccgcg cgcgccagca all^t^T ^t^'^l^^^^c cgcgcgtcga ttacatctcc 720 
taccSag^t? gcggca'cS ca'cSSggc gc^tt™ ^^Sc^f tgccatctcc 780° 
catgctagag aaaactcaac aaaa«^^^^= ^ aagccgcgca aattcttctt 840 

ggagacccta gacc^atSc aSactg cgaSattca^ f ^^ata ttccaatggg 900 
ggcatatggc aagggaacat tcgSaacta a«h™^!^^ ffagtggagat cttcactttt 960 
cactgttacc tgctacacag tStgaaoaa ttS^f cttccacccc aaaggaggag 1020 
gaagatctac cttctgggag ttttfttcaa aataat^f tagctcgccg ggcattgcat 1080 
gatgaaggca aggactgctg tgaccaaaf^ tccactgctc atatctttgt 1140 

cattttgagt gcatctgtga alagaaatat ^^^^^^^^ aatgtgggac acacacaggc 1200 
gcttgcccat cggggaca?a caaacc^aa^ aao?^^"^^^ gtctgcagta tgaatgcaca 1260 
ccatgtcctg atgaaaatca cacctctcca Itnc ^''^^ gaggaatcag cagttgcatt 1320 
tgcagagagg gaLcagggc atctggccaa Tcrfat^^^^ catcccctga agactgtgtc 1380 
aagcctcccg aaaatgg?ta cttta?ccaa ITalrlf^^" "^^^^-^^^tg ccctgccctg 1440 
tgtggggtcc gatgtcaccc tgg^ttSat cttat™ ^^aaccactt caatgcagcc 1500 
cccaatggtt tgtggtccgg t^cagagagc tJcgcSag Saaaa^'f f ^atgtcta 1560 
cgccagccga aacatggcca catcagctat te?™^^ taagaacatg tcctcatctc 1620 
tgtttggttg cctgtgatga agggtlcS Ctaaaaaa?^ aaatgttata taagacaaca 1680 
ggaaacagcc agtgggatgg gcJagaSccc tlnf^t^^ gtgataagct tacttgtcaa 1740 
cagatgccca aagSgtca? StSccccc caSaSta ^r^^'^"'^ ttccaccttt 1800 
gggacgatct gctatgtaag ttgccaccaa ™f gcaagcagcc agccaaattt 1860 

ctgagatgta ccaot^ctgj aaaatSat a?™^^^ ^^^""^^^^^^ caaagaaatg 1920 
gtggaggctc ctcaaatcaa ctgtcctaa^ ^^^^^ aggcagctgt gtgtaaagac 1980 
gattctgcca atgttacctg gcagattccf aStaala f ^^^^5*=' Sfgaacagcaa 2040 
tcagtccacg ttcatccagc ?ttcacccca cct?aS^^ ^^^^"^^^^^^Sr tgaaaaggtg 2100 
atcgtataca cggcaactga cctatccaar tt^^t tcccaattgg agatgttgct 2160 

gttattgatg cSaaccacc Scata?ac Katacaaa? ^^^'^"'^ ccatatcaag 2220 

yut.acagac tggtgcagat ctccacctcc cgtccaggtc 2280 



86/89 



wo 03/050253 

PCT/US02/39133 

liiitztii itziiis tf£iir.i 

ccgataat. otggaScS c^tacatM JSS?^ "agssattt tatatgcact 2520 

i= ~ i s a= iiii B 

tS a'SS cS?gS a'SSScI "-"SI 27."S 

Sii sii s= sSi iSi iiis 2i 

cggattaiaa gaagtgcccc att^^^t^:: tcggcaatgc caagtcctca 3000 

gctagtgtgc caSacccaa tl^t^nl^l^ tataaaatta agttaatttt taacatcaca 3060 

ctccttcaaa r^ll^^^^i tgaaagaaat gatacccttg aatgggaaaa tcagcaacga 3120 

atgtattSt tJcagcttgc atc^gSaS ^ttat'^"""" ^^actctcaa caa^gacccc lilt 

™S iiii — - s= ^^SS 

sss s= £ ~ ss::s g— 

aacatttcta catataltnt fl^tTt^ aacacctcaa ctgtgaaaag aggagccgtg 3600 
ccctg"acc catgtcctcg taactlttac f ^^^f^^f tctcgcgttc tgggttaatg 3660 
gcctgtccct tttatSac taccccattc gotaattcca IT^^t^^'' cttctgcctg 3720 

iiiiiisiiiisiiii 

=s ixi ai -T'"" sssco ^s^s 

si i?-.' SgSS a^^SSgc" L^JS 

S= a-L'SS = -SK" So" 

ssis a's^sl is HEi ~ »™' 

ttaaattcat acagttgtaa ftgtcaacca aggccacctg tgtggatgaa 4500 

cagtctacag gctttaacct aShfh^f ggattttcag gcaaaaggtg tgaaacagaa 4560 
gatggcatgc ?cccatctct ccata^S^« gtttctggca tctatggata tgtcatgcta 4620 
gaca?gaact atggaaScc aatScct^? aTfl^T '^'='^^^^^9^^ atcctctgac 4680 
ctcctgactg attataacgg ctgggttctt Tatataltll ""^^^^^^^^^ caataccttg 4740 
tgtccctcoa taaatcrat-S^ ^^^f? ^ tatgtgaatg gcagggaaaa gataacaaac 4800 
ggcatctgga 2gS?ata? cgSgaaaa tcacttggac aagtgccaat 4862 

ggtttgccia tacctggtgg tggSta gtggtgctgg cctctctgtt 4920 

gagggattca gccca^ctg? ScJftlJS aactc^f aagagcaaga caaaaaagga 4980 
tatatrohrrh ^t-^^=.^= gtcccccgtg ggctccataa gccagctcaa cctctgggac 5040 

si ~ HFiF» s?= ssfe si 

ss^gaS ."i£sc SE i™ ~" IJSS 

ctattgggtg actcaaggft gtaacctacc agtgcaacaa tggctactat 5460 

tcctgcc^tg a^gtcga^ga ctatSS^? ^ataatggga gctggaacgg cgtttcacca 5520 
ctgaacgtag atSatcSa ca^^^^f^ ggatcagatt gtagtgagca tgcttcttgc 5580 
aactgtgcag aacctatSa ^tlt^ t '^^^^^^^^^^^ cgtacacagg agatgggaaa 5640 
ggtgIgS? ltZlT.T9 tScgcStc IZlTtlaat -ggaaaatgg ccactcctca 5700 
atgggagtaa ccaaaatcar acattttcgt gtcaggaagg ataccagttg 5760 

tgEaLgctg tttcataSa tlttnl^^^f tctggagaat ggaatcatct aataccatat 5820 
gcatttact? tggScaa SSaca^^t ^Satatfata tgaggagtta 5880 

gataaagaat ca£cctgtct tgcSacagt IctlaalS.^ aaggatatac tctggccggt 5940 
ccagtgaagt g.tctagtcc gjaaa^ta^g T^l^f^l 1^ 
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llllttTall TcTclTotl: JgStcS ^^^^^^^^^ acagcttaca gggcccttcc 6120 
ttctgtSgag Lcclcctgc cScaSSt T.tTT.^ cacctgcctg tcacctcgtc 6180 
aggaacaccg tcacttacac t?acaaaa^^ gctgtcatta cggggaataa cttcactttc 6240 
gaatgcctgj ccgacggcaa gtgg^Jtaaa latlt ""^^ ttgctggtct tgacaccatt 6300 
gatgagccac ccattg?gga ccacgcctct ccaaa.^.r ^^^^^"^^^^ tgtctcctgt 6350 
attgcattct actactgctc taltaatltn ^"^^^f^^-^tg cccatcggct ctttggagac 6420 
aatgcccagg gcaagtgggt acccccSa aaS^^""^^ acaattccca gcttctctgc 6480 
ttctgtgaaa aacctccKc gSttcSJ? laS^.fr ^^^f^^^^^^g tatagctcat 6540 
tttgcagctg gctcagttgt gfgctttaaa tla^ln^^^^ aatctgtgag caaagcaaaa 6600 
gcaaagattg aatgtatgag lallaaat^n ^^^^"^^^^f^ gctttgtact gaacacctca 5660 
atccctgtgc ggtgtggaga SScSg? Xc^tg^ata TatT''"" -^^--^^gc 6720 
tacagttttg gagccatggt actcttZrt^n f^""^^^^^^^ gctatgcaag tggatcaaac 6780 
aagaagagca calgcTeSc ?acSgacaa taatTT ^f'^^t^'^^t caaaggggaa 6840 
gtatcttgtg gtgaaccacc taaggSag Sct^tc f "^^'^^'^ ^Ttgccacccg 6900 
atctttgaga gtgaagtgag gta?cag?a? lllaralnr-t tggagcatac aactggcagg 6960 
gtatttgtct gccaagccaa tcgaclctla atr.nf^tl ataagtcagt cggaagtcct 7020 
ctcgactgtg gaaaacctcc cccgatccag aatgacttca ^r"'*'*'^^^'' Stgtgttcct 7080 
gtagggtcca aggttcagtt tttctataa? tgaaaggaga aaactttgaa 7140 

tggacatgtc agaaatc?gg caaataaS tlf.^^ l^^^ agcttgttgg tgacagttct 7200 
aagtgcccag agccgcccc? cttggaaaac caar^!^^^^ atccaaagtg catgcctgcc 7260 
gtaggagttg tgacattttc c^gSaagaS gggcatatcc taof gaccaccgag 7320 

aaatgcttgc catcccagca atggaaSac tctt^S^^S tgcaaggccc ctctgtcctg 7380 
accccaactc ccctaatStc ctSgSgtc Sattcc^? r^ltTT'' ^^"ctttgt 7440 
agtactgtca agtattcttg tgtaggtggg tttSccta^ cttctgctct tcattttgga 7500 
tgccaacctg atggcacctg gfgc?ctcca ctJccaS? taccaccctc 7560 

cacctgagga atccccatgg aatcattgat atao^™^ gtgttccagt agaatgtccc 7620 
ctctatacct gcaagccagq ctttaaaff^ ^tgcaaggcc ttgcctatct cagcacagct 7680 

aatggtcact ^ct^gga^^ T.^^lTcllol gSSgSa t^'gaSaS ^^^^^^^^^ ™ 
gagattttga atggcaaatt ctr■^^pn=,^^ f ^ ? ttgagtgcct gaaacccaag 7800 
tcttgcaacc gaggc^^tc^ gctcgaaaa? ^r^^^f^^^ f^gacagac cgttacctac 7860 
gattgggatg tagatgcccc ItcttgcSt acc^f^^''^ ^^''''^^"'^ agagacaggfc 7920 
gaaaatggtt ttgtagaagg tgcaSt^ac f Sr^gattcccc acaacccatt 7980 

ttccctgggt ttcagltgg? tggtcaSc Sacaafcc? T^^^^""^^ ctacagttgc 8040 
agttccatcc caacatgtat gccaataafr ^^^^ f gtgaagagtc aggatggtca 8100 
gactgtacta aactcaSgJ ?gaJcaaaaa ctcctcatat agattttgga 8160 

gttccatatg tgactcct?a ccctcc??at catttJaaf "^^^^^^^^^ catgatggaa 8220 
aatacaaagg agtctcctgc tacacattc^ tcaa2??tc tTf^T^ aacctgggaa 8280 
tacacctgta atccaggata taaact^r^^ ^^S! 5 tgtatggtac catggtttca 8340 
ggaacttgga atggcfgtgc accatcctac Jft f.^ tgctgatctg ccaggaagat 8400 
cctgaaaatg gctttttgcg ttttacaS^ atttcaattg aatgtgactt gcctactgct 8460 
tgtaaacctg gacac^Sc? agcagg^^S glltT£ll ftt^lT^'' gcagtatagc 8520 
tggagtggtg cctccccacg cSgtgaagcc atttcatar« "'^^'^^^^Sra gaatagaaag 8580 
aatggatcca tcaaaggaag caactacica ^ aaaagccaaa tccagtcatg 8640 

cccggatatg tgctgfSgg caStgaSgg Saacatacc f ^^^^ tgagtgtgac 8700 
gaggatgagc ccatttgcat tcctSgSc tart^flT aggatgacaa aaactgggat 8750 
caggtgagag gagacglgta cac2^Sa afagStta aa^^^'^.'" agccaatggc 8820 
ttcttgcttg agggagccag gagtcgggtt tatcSo^n» aatacacttg caatgaaggg 8880 
actcccgact gtgtgcctgt laSSS" accccScac L^f^^"^ gagtggagcc 8940 
gaaggcctgg actatggctt catgaaoaaa attln^f^ aactggccaa tggggtgacg 9000 
ttgcacggtg ctccalaact caccta?caa tr-^^tt ^^^gtcacga gggctacatc 9060 
ctctgtaaac cagtcaactg tggacctcc? SSatc?ta ^^""^f^^^f^ agagattcct 9120 
ttttccttta ttcatggggg ccltat^ag ITtTagtlat ttccSSJ^ ^'^^f'^^*^ 
ggaaattcat caagaaggtg cctctccasf nr,r.^^ I ttcctggtta taagctccat 9240 
ctgccttgca gatgttcSc accagtaatt gaaSaa olT'T"" ^^="<=<=t^- 93 00 
gactgtggaa aggcagcccg aattcaatao ft^lt^^ t ^'^^^caatgg gacagatttt 9350 
gaaatcacct ggaagccga ^gjccagtgg aTctataaal .^"""^^^^^^ aggactttct 9420 
tcttgtggtt ctcttccalt glllZlllf aaaltltf^ tcccccactg tgaacacact 9480 
gaaaatgtga taacttacag c?gcagg??t SaSa^^^ srtgagaccag ctcttggaag 9540 
atttgtacag agaaaggggt atggaSaJ cStatccaa trt^t^^""^ ttcagatctg 9500 
SSfa- ~S alSSL- -Saaal.^ .MS 

tgtcagaaag atggecjctg S^g g^Sfa L^^t^^^^ lalS 
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ctcccggaaa acataacaca tatacttgtt catggggacg atttcagtgt gaataggcaa 9900 
gtttctgtgt catgtgcaga agggtatacc tttgagggag ttaacatatc agtatgtcag 9960 
cttgatggaa cctgggagcc accattctcc gatgaatctt gcagtccagt ttcttgtggg 10020 
aaacctgaaa gtccagaaca tggatttgtg gttggcagta aatacacctt tgaaagcaca 10080 
attatttatc agtgtgagcc tggctatgaa ctagagggga acagggaacg tgtctgccag 10140 
gagaacagac agtggagtgg aggggtggca atatgcaaag agaccaggtg tgaaactcca 10200 
cttgaatttc tcaatgggaa agctgacatt gaaaacagga cgactggacc caacgtggta 10260 
tattcctgca acagaggcta cagtcttgaa gggccatctg aggcacactg cacagaaaat 10320 
ggaacctgga gccacccagt ccctctctgc aaaccaaatc catgccctgt tccttttgtg 10380 
attcccgaga atgctctgct gtctgaaaag gagttttatg ttgatcagaa tgtgtccatc 10440 
aaatgtaggg aaggttttct gctgcagggc cacggcatca ttacctgcaa ccccgacgag 10500 
acgtggacac agacaagcgc caaatgtgaa aaaatctcat gtggtccacc agctcacgta 10560 
gaaaatgcaa ttgctcgagg cgtacattat caatatggag acatgatcac ctactcatgt 10620 
tacagtggat acatgttgga gggtttcctg aggagtgttt gtttagaaaa tggaacatgg 10680 
acatcacctc ctatttgcag agctgtctgt cgatttccat gtcagaatgg gggcatctgc 10740 
caacgcccaa atgcttgttc ctgtccagag ggctggatgg ggcgcctctg tgaagaacca 10800 
atctgcattc ttccctgtct gaacggaggt cgctgtgtgg ccccttacca gtgtgactgc 10860 
ccgcctggct ggacggggtc tcgctgtcat acagctgttt gccagtctcc ctgcttaaat 10920 
ggtggaaaat gtgtaagacc aaaccgatgt cactgtcttt cttcttggac gggacataac 10980 
tgttccagga aaaggaggac tgggttttaa ccactgcacg accatctggc tctcccaaaa 11040 
gcaggatcat ctctcctcgg tagtgcctgg gcatcctgga acttatgcaa agaaagtcca 11100 
acatggtgct gggtcttgtt tagtaaactt gttacttggg gttacttttt tttattttgt 11160 
gatatatttt gttattcctt gtgacatact ttcttacatg tttccatttt taaatatgcc 11220 
tgtattttct atataaaaat tatattaaat agatgctgct ctaccctcac aaaatgtaca 11280 
tattctgctg tctattggga aagttcctgg tacacatttt tattcagtta cttaaaatga 11340 
tttttccatt aaagtatatt ttgctactaa ataatatttt gctggatagt accattttat 11400 
gaggtggcca agggattcat agagaggact catgatcttt catctgtgcc actggcacca 11460 
cataggagac cccttacaat aaaggagacc cctttgttct ggttccatct tctatacaat 1152 0 
atgtgtaata ataaaagctt tttgta 11546 
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